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CLASSIFICATION OF POLYPEPTIDES BY LIGAND GEOMETRY AND 



RELATED METHODS 



BACKGROUND OF THE INVENTION 



The present invention relates generally to 
5 interactions between ligands and polypeptides and more 

specifically to determining structure-related properties of 
a ligand when bound to different polypeptides. 



chemistry and biology due to the correlation between the 
10 structure of a molecule and its function. Although a full 
understanding of this correlation is not yet established, 
one can gain insight into the function of a molecule from 
its deduced structure. Thus, the structure can provide a 
strong basis for formulating experiments to determine 
15 function. Conversely, the eventual disclosure of a 

structure for a well studied molecule can have a significant 
effect in converging apparently disparate observations of 
function into a consistent description of the molecule's 
activity . 

20 Practical applications which are becoming 

increasingly dependent upon structure information include, 
for example, the production of therapeutic drugs. 
Therapeutic drugs can be designed by synthesizing a molecule 
that mimics a ligand known to interact with a target 

25 receptor. Alternatively, a therapeutic drug can be designed 
by computer assisted methods in which a molecule is designed 
to dock to a binding site on a receptor of known structure. 



Structure determination plays a central role in 
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By structure -based methods such as these, lead compounds can 
be identified for further development. 



Using a similar structure based approach a 
receptor can be engineered to yield improved or novel 
5 functions. For example, changes can be made at a ligand 
binding site in a polypeptide receptor based on the known 
structure of the receptor. Given that a polypeptide 
receptor can contain hundreds or even thousands of amino 
acid residues, of which only a few may contact a ligand, 

10 structural information is useful in identifying where 

changes should be made in the polypeptide to alter ligand 
binding. Polypeptide receptors engineered as such can be 
used for a variety of practical applications including, for 
example, industrial catalysis, therapeutics, and 

15 bioremediation . 



Although methods for .structure determination are 
evolving, it is currently difficult, costly and time 
consuming to determine the structure of a polypeptide or 
ligand. It can often be even more difficult to produce a 

20 polypeptide-ligand complex in a condition allowing 
determination of a structure for the bound complex. 
Resorting to determining a structure for the receptor 
individually can have limited value, particularly if the 
location of ligand binding is difficult to identify due to 

25 the large size of most polypeptide receptors. Similarly, 
determination of a structure of an unbound ligand can have 
limited usefulness because an unbound ligand has multiple 
conformations and the most stable conformation of an unbound 
ligand is often different from its conformation when bound 

30 to a receptor. 
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Theoretical modeling of ligand-polypeptide 



interactions is one alternative that has been attempted in 
cases where the structure of the polypeptide- ligand complex 
is not available. In this approach a ligand is fitted to a 
5 structure of a polypeptide. The polypeptide structure used 
can be determined empirically or theoretically. Theoretical 
determination of a hypothetical molecular structure for a 
polypeptide by ab initio methods is a relatively undeveloped 
method. Another theoretical approach, referred to as 
10 homology modeling, has been used to infer structure based on 
comparison with molecules of known structure. 



determining polypeptide-ligand interactions relies upon 
choosing a correct polypeptide template for comparison. In 

15 most cases criteria for comparison are unavailable or 
unreliable. For example, it is common to produce a 
hypothetical structure of a target polypeptide based on the 
empirically determined structure of a template polypeptide 
having similar sequence. However, similarities in sequence 

2 0 do not always yield similar structures and conversely, 

similar structures have been observed for two polypeptides 
having significantly diverged sequences. 



identify properties of a ligand that confer binding 
25 specificity for polypeptide receptors. A need also exists 
for methods to classify polypeptides and ligands according 
to structural characteristics. The present invention 
satisfies this need and provides related advantages as well. 



The successful application of homology modeling to 



Thus, there exists a need for efficient methods to 
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SUMMARY OF THE INVENTION 



The invention provides a method for identifying a 



pharmacocluster . The method includes the steps of (a) 
determining bound conformations of a ligand bound to 
5 different polypeptides, and (b) clustering two or more bound 
conformations of the ligand having substantially the same 
bound conformation, thereby identifying a pharmacocluster. 
The invention also provides a method for identifying a 
member of a pharmacocluster. The invention also provides a 

10 method for identifying a polypeptide pharmacof amily . The 
method includes the steps of (a) determining bound 
conformations of a ligand bound to different polypeptides of 
a polypeptide family, and (b) identifying two or more bound 
conformations of the ligand having substantially different 

15 bound conformations, thereby identifying at least two 

polypeptide pharmacof amilies exhibiting binding specificity 
for the two or more substantially different bound 
conformations of the ligand. 



database of 156 bound structures of nicotinamide adenine 
dinucleotide or nicotinamide adenine dinucleotide phosphate. 
Structures were generated using the overlay function in 
INSIGHT98 (Molecular Sirnulations Inc., San Diego, CA) . 



BRIEF DESCRIPTION OF THE DRAWINGS 



20 



Figure 1 shows pharmacoclusters identified from a 



25 Figure 2 shows the nomenclature used herein for 

atom names in the NAD(P) molecule. 
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Figure 3 shows conformer models with interacting 
atoms from bound polypeptide and ordered waters overlayed, 
Models in parts A through H were derived from 
pharmacoclusters 1-8, respectively as described in the 
5 Examples. Overlayed atoms and waters are identified as 
either hydrogen bond donors (donors) , hydrogen bond 
acceptors (acceptors) , sulfurs (sulfurs) , waters (waters) , 
or atoms that can be hydrogen bond acceptors or hydrogen 
bond donors (acceptors/donors) according to the legends 
10 under each conformer model. 



Figure 4 shows a portion of a 2D [^H,^H] NOESY 
spectrum recorded with a 0 . 2 ml sample of 1 mM NADP and 200 
|iM of enzyme 1-deoxy D-xylulose 5 -phosphate reductoisomerase 
(DOXP) . Atoms are identified according to Figure 2. Spectra 
15 are reported as parts per million (ppm) . Since the ligand 
is in fast exchange and is in excess over polypeptide, cross 
peaks represent transferred NOEs . 

Figure 5 shows high affinity binding of compound 
TTEOOOl . 001 .A07 to polypeptide enzymes of pharmacof amily 1 

20 (panel A) and pharmacof amily 8 (panel B) . Double reciprocal 
plots of reaction rate versus concentration of NADH (panel 
A) or NADPH (panel B) are shown for each enzyme in the 
presence of various concentrations of compound 
TTEOOOl . 001 .A07 . Concentrations of compound TTEOOOl . 001 .A07 

25 shown to the right of the plot A correspond 7.1 ^iM (open 

triangles), 3 . 6 laM (closed triangles), 1 . 8 [iM (open circles) 
and no added compound (closed circles) . Concentrations of 
compound TTEOOOl , 001 .A07 shown to the right of the plot B 
correspond 56.2 |aM (open triangles), 37.5 jaM (closed 




triangles), 18.7 \xM (open circles) and no added compound 
(closed circles) . Inhibitory dissociation constants (K^g) 
determined from the data are shown in the upper left corner 
of the respective plot. 



TTEOOOl . 002 .D02 to a polypeptide enzyme of pharmacof amily 1. 
A double reciprocal plot of reaction rate versus 
concentration of NADH is shown for the enzyme in the 
presence of various concentrations of compound 
10 TTEOOOl . 002 .D02 . Concentrations of compound TTEOOOl . 002 ,D02 

shown to the right of the plot A correspond 20.6 |iM (open 
triangles), 13.7 [xM (closed triangles), 6 . 9 |LiM (open 
circles) and no added compound (closed circles) . An 
inhibitory dissociation constant (K^g) determined from the 
15 data is shown in the upper left corner of the plot. 



the coordinates presented in Table 3 for pharmacof amily 1. 
Figure 7A shows a feature of the pharmacophore model 
including a volume defining the shape of conformer model 1 

2 0 which is indicated by grey spheres and superimposed on the 
conformer model having coordinates listed in Table 3C. 
Figure 7B shows three features of the pharmacophore model 
including a hydrophobic region of the nicotinamide ring, a 
hydrogen bond acceptor positioned at the averaged 

25 coordinates for the location of 17 hydrogen bond acceptors 
in the polypeptides of pharmacof amily 1, and a hydrogen bond 
donor positioned where a hydrogen bond donor of a ligand 
would be expected to have favorable interactions with 
hydrogen bond acceptors observed in 11 of the 17 

30 polypeptides in pharmacof amily 1. Figure 7C shows a 
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Figure 6 shows high affinity binding of compound 



Figure 7 shows a pharmacophore model derived from 
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combination of features of figures 7A and 7B present in a 
pharmacophore model and superimposed on the conformer model. 

DETAILED DESCRIPTION OF THE INVENTION 

The invention provides pharmacoclusters and 
5 methods for identifying a pharmacocluster from bound 

conformations of a ligand bound to different polypeptides. 
The methods are applicable for identifying a conformation- 
dependent property of a ligand based on bound conformations 
of the ligand in a pharmacocluster. The methods are also 
10 applicable for classifying polypeptides, from a family of 
polypeptides that bind the same ligand, into 

pharmacof amilies based on bound conformations of the ligand. 
Accordingly, methods are provided for grouping polypeptides 
into pharmacof amilies by determining bound conformations of 

15 a ligand or a conformation-dependent property of a ligand 
independent of a determination of the structure of the 
polypeptide. An advantage of classifying polypeptides 
according to bound conformations of a ligand is that a 
pharmacof amily is likely to contain polypeptides having 

20 greater binding specificity for a particular molecule than 
other polypeptides in the same family. Thus, the methods 
allow identification of a pharmacof amily that can 
specifically interact with a particular therapeutic agent or 
drug . 

25 Additionally, the methods of the invention can be 

used to determine a conformer model or pharmacophore model 
based on a bound conformation or conformation-dependent 
property of a ligand bound to polypeptides in a 
pharmacof amily . The invention is therefore advantageous in 
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providing a model for the design and identification of 
therapeutic compounds having specificity for a 
pharmacof amily of polypeptides. 



Another advantage of the invention is that the 



5 methods provide a correlation between ligand conformation, a 
parameter that is relatively easy to measure, and 
polypeptide structure, a parameter of tremendous value but 
often difficult to measure. Therefore, the methods of the 
invention can be used to determine structural 
10 characteristics of a polypeptide based on a conformation- 
dependent property of a bound ligand. 



to a collection of substantially the same bound 
conformations of a ligand, or portion thereof, bound to two 

15 or more polypeptides. A member conformation of a 

pharmacocluster can have (1) a conformation that is more 
similar to an average conformation of the members in its 
pharmacocluster than to any other pharmacocluster and (2) a 
conformation that is more similar to an average conformation 

20 of the members in its own pharmacocluster than the most 

similar average structures from different pharmacoclusters 
are to each other, wherein the pharmacoclusters consist of 
conformations of the same ligand or portion thereof. The 
pharmacocluster is determined for a ligand bound to 

25 different polypeptides but does not require that a structure 
of the polypeptide be known or included as part of a bound 
conformation of a ligand. A bound conformation of a ligand 
can include the entire ligand structure or selected atoms 
including a portion of the complete atomic composition of 

3 0 the ligand so long as the number of atoms provides 



As used herein, the term "pharmacocluster" refers 
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sufficient information to distinguish one pharmacocluster 
from another. A pharmacocluster can include both the bound 
conformations of a ligand, or portion thereof, and one or 
more atoms that both interact with the ligand and are from a 
5 bound polypeptide. Thus, a pharmacocluster can include 
conformational information of 1 or more, 2 or more, 5 or 
more, 10 or more, 2 0 or more, 3 0 or more, 4 0 or more, 5 0 or 
more or 100 or more atoms of a ligand bound conformation. 

Accordingly, portions of bound conformations of 

10 two or more different ligands can be included in a ligand 
pharmacocluster so long as the portions selected from each 
ligand have a core bound conformation that is substantially 
the same. A core bound conformation can consist of portions 
of bound conformations of ligands wherein the portions have 

15 identical structural formula and conformation. A core bound 
conformation can also consist of portions of bound 
conformations of ligands wherein the portions have different 
structural formulas so long as the portions have 
substantially the same conformation. The structural 

2 0 formula, as it is understood in the art, is a 2 dimensional 
representation of a molecule that identifies the atoms and 
covalent bonds between each atom in the molecule. The 
structural formula does not necessarily include information 
sufficient to determine conformation of a molecule. For 

25 example, a common structural formula representation of 

cyclohexane can be a hexagon with 2 hydrogens attached to 
each carbon being in equivalent positions. However, a 
stable conformation of cyclohexane in solution may appear as 
a "chair" or ''boat" shape with hydrogens in either axial or 

30 equitorial positions relative to the molecular plane. 
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As used herein, the term ''conformation-dependent 
property," when used in reference to a ligand, refers to a 
characteristic of a ligand that specifically correlates with 
the three dimensional structure of a ligand or the 
5 orientation in space of selected atoms and bonds of the 

ligand. Thus, a ligand bound to a polypeptide in a distinct 
conformation will have at least one unique conformation- 
dependent property correlated with the bound conformation of 
the ligand. A conformation-dependent property can be 

10 derived from or include the entire ligand structure or 

selected atoms and bonds, including a fragment or portion of 
the complete atomic composition of the ligand. A 
conformation-dependent property that includes selected atoms 
and bonds of a ligand can include 2 or more, 3 or more, 5 or 

15 more, 10 or more, 15 or more, 20 or more, 25 or more, or 50 
or more atoms of a bound conformation of a ligand. 

A characteristic that specifically correlates with 
a three dimensional structure of a ligand is a 
characteristic that is substantially different between at 

20 least two different bound conformations of the same ligand 
and, therefore, distinguishes the two different bound 
conformations. A conformation- dependent property can 
include a physical or chemical characteristic of a ligand, 
for example, absorption and emission of heat, absorption and 

25 emission of electromagnetic radiation, rotation of polarized 
light, magnetic moment, spin state of electrons, or 
polarity. A conformation-dependent property can also 
include a structural characteristic of a ligand based, for 
example, on an X-ray diffraction pattern or a nuclear 

3 0 magnetic resonance (NMR) spectrum. A conformation-dependent 
property can additionally include a characteristic based on 
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a structural model, for example, an electron density map, 
atomic coordinates, or x-ray structure. A conformation- 
dependent property can include a characteristic 
spectroscopic signal based on, for example, Raman, circular 
5 dichroism (CD) , optical rotation, electron paramagnetic 
resonance (EPR) , infrared (IR) , ultraviolet/visible 
absorbance (UV/Vis) , fluorescence, or luminescence 
spectroscopies. A conformation-dependent property can also 
include a characteristic NMR signal, for example, chemical 

10 shift, J coupling, dipolar coupling, cross-correlation, 
nuclear spin relaxation, transferred nuclear Overhauser 
effect, or combinations thereof. A conformation-dependent 
property can additionally include a thermodynamic or kinetic 
characteristic based on, for example, calorimetric 

15 measurement or binding affinity measurement. Furthermore, a 
conformation-dependent property can include characteristic 
based on electrical measurement, for example, voltammetry or 
conductance . 

As used herein, ''selected" conformation-dependent 
20 properties are identified to form a set of conformation- 
dependent properties that can include, for example, the 
entire set of conformation -dependent properties associated 
with the bound conformations of a ligand in a 
pharmacocluster or a subset of conformation-dependent 
25 properties associated with the bound conformations of a 
ligand in a pharmacocluster, so long as the subset of 
conformation -dependent properties are sufficient to identify 
a unique conformation of the ligand, A selected 
conformation-dependent property can include any of the above 
30 described properties, for example, a physical or chemical 
property, structural data, a structural model, a 
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spectroscopic signal, a thermodynamic or kinetic measurement 
or an electrical measurement. 



As used herein, the term ''bound conformation," 
when used in reference to a ligand, refers to the location 
5 of atoms of a ligand relative to each other in three 
dimensional space, where the ligand is bound to a 
polypeptide* The location of atoms in a ligand can be 
described, for example, according to bond angles, bond 
distances, relative locations of electron density, probable 
10 occupancy of atoms at points in space relative to each 

other, probable occupancy of electrons at points in space 
relative to each other or combinations thereof. 



As used herein, a ''selected" bound conformation 
refers to a set of bound conformations that can include, for 
15 example, the entire set of defined bound conformations or a 
subset of bound conformations of a ligand. 



As used herein, the term "clustering" refers to 
assigning related bound conformations of a ligand, or 
portion thereof, into a first collection such that the 

20 conformations residing in the first collection can be 

overlaid with substantial overlap and bound conformations 
from two different collections cannot be overlaid with a 
better overlap than that resulting from members of the first 
collection. Exemplary clustering of ligand conformations 

25 are disclosed herein (see Example I) . 

As used herein, the term "ligand" refers to a 
molecule that can specifically bind to a polypeptide. 
Specific binding, as it is used herein, refers to binding 
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that is detectable over non-specific interactions by 
quantifiable assays well known in the art. A ligand can be 
essentially any type of natural or synthetic molecule 
including, for example, a polypeptide, nucleic acid, 
5 carbohydrate, lipid, amino acid, nucleotide or any organic 
derived compound. The term also encompasses a cof actor or a 
substrate of a polypeptide having enzymatic activity, or 
substrate that is inert to catalytic conversion by the bound 
polypeptide. Specific binding to a polypeptide can be due 
10 to covalent or non covalent interactions. 

As used herein, the term "bound to two or more 
polypeptides," when used in reference to a ligand is 
intended to refer to two or more complexes consisting of a 
ligand and a polypeptide. A complex can include, for 

15 example, a single ligand bound to a single polypeptide. A 
complex can also include a single ligand bound to more than 
one polypeptides including, for example, a complex in which 
a ligand is bound at the interface of interacting 
polypeptides. A complex can also include multiple ligands, 

20 however, conformation dependent properties of all ligands of 
the complex need not be identified. A complex results from 
a specific interaction between a polypeptide and a ligand. 

As used herein, the term ''substantially the same," 
when used in reference to bound conformations of a ligand, 

25 or portion thereof, is intended to refer to two or more 

bound conformations that can be overlaid upon each other in 
3 dimensional space such that all corresponding atoms 
between the two conformations are overlapped. Accordingly, 
"substantially different" bound conformations cannot be 

3 0 overlaid upon each other in 3 -dimensional space such that 
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all corresponding atoms between the two bound conformations 
are overlapped. 

As used herein, the term "polypeptide" is intended 
to refer to a peptide polymer of two or more amino acids. 
5 The term is similarly intended to include polymers 
containing amino acid sterioisomers , analogues and 
functional mimetics thereof. For example, derivatives can 
include chemical modifications of amino acids such as 
alkylation, acylation, carbamylation, iodination, or any 

10 modification which derivatizes the polypeptide. Analogues 
can include modified amino acids, for example, 
hydroxyproline or carboxyglutamate , and can include amino 
acids, or analogs thereof, that are not linked by peptide 
bonds. Mimetics encompass chemicals containing chemical 

15 moieties that mimic the function of the polypeptide 

regardless of the predicted three-dimensional structure of 
the compound. For example, if a polypeptide contains two 
charged chemical moieties in a functional domain, a mimetic 
places two charged chemical moieties in a spatial 

20 orientation and constrained structure so that the 

corresponding charge is maintained in three-dimensional 
space. Thus, all of these modifications are included within 
the term "polypeptide" so long as the polypeptide retains 
its binding function. 

2 5 As used herein, the term "root mean square 

deviation," or RMSD, refers to a standard deviation which 
quantifies the structural variability in a population of 
bound conformations of a ligand. The term is intended to be 
consistent with its meaning as understood in the art as 

3 0 described for example in Doucet and Weber, Computer- Aided 
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Molecular Design: Theory and Applications , Academic Press , 
San Diego CA (1996) , 



reference to characterizing polypeptides having ligand 
5 binding activity, is intended to refer to polypeptides that 
can bind to the same ligand, or portion thereof. A 
polypeptide family can contain polypeptides having binding 
activity for a common ligand with sufficient affinity, 
avidity or specificity to allow measurement of the binding 

10 event. As defined herein a ''member'' of a polypeptide family 
refers to an individual polypeptide that can be classified 
in a polypeptide family because the polypeptide binds a 
ligand, or portion thereof, that binds another polypeptide 
in a polypeptide family. The bound conformations of a 

15 ligand bound by individual members of a family can be 
substantially the same or different from each other. 



used in reference to polypeptides, is intended to refer to 
polypeptides that can be classified together in a population 

20 because they individually bind a ligand such that the ligand 
is bound in substantially the same conformation. As defined 
herein a ''member" of a polypeptide pharmacof amily refers to 
an individual polypeptide that is classified in a 
polypeptide pharmacof amily because the polypeptide binds a 

25 conformation of a ligand that is substantially the same as a 
conformation of the ligand bound to another polypeptide in 
the pharmacof amily . 



As used herein, the term "f 



amily 



, " when used in 



As used herein, the term "pharmacof amily , 



" when 
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As used herein, the term "grouping" refers to 
assigning related polypeptides into a family or 
pharmacof amily such that the polypeptide members of a family 
bind the same ligand and the polypeptide members of a 
5 pharmacof amily bind substantially the same bound 
conformation of a ligand. 



As used herein, the term "fold," when used in 
reference to a polypeptide, refers to a specific geometric 
arrangement and connectivity of a combination of secondary 

10 structure elements in a polypeptide structure. Secondary 
structure elements of a polypeptide that can be arranged 
into a fold including, for example, alpha helices, beta 
sheets, turns and loops are well known in the art. Folds of 
a polypeptide can be recognized by one skilled in the art 

15 and are described in, for example, Branden and Tooze, 

Introduction to protein structure . Garland Publishing, New 
York (1991) and Richardson, Adv . Prot . Chem . 34:167-339 
(1981) . 



As used herein, "modeling the three dimensional 
2 0 structure" when used in reference to a polypeptide refers to 
determining a conformation for a polypeptide. A 
conformation of a polypeptide can be determined, for 
example, from empirical data specifying structure or from a 
compared conformation used as a template. A conformation 
25 can be determined at any desired level of resolution 

sufficient to identify, for example, overall shape of a 
polypeptide, tertiary structure elements, secondary 
structure elements, polypeptide backbone structure, amino 
acid residue identity or location of individual atoms. 
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As used herein, the term ''structural model," when 
used in reference to a polypeptide, refers to a 
representation of a 3 dimensional structure of a 
polypeptide. A structural model can be determined from 
5 empirical data derived from, for example. X-ray 

crystallography or nuclear magnetic resonance spectroscopy. 
A structural model can also be derived from a theoretical 
calculation including, for example, comparison to a known 
structure or ab initio molecular modeling. A representation 
10 of a structural model can include, for example, an electron 
density map, atomic coordinates, x-ray structure model, ball 
and stick model, density map, space filling model, surface 
map, Connolly surface. Van der Waals surface or CPK model. 

As used herein, the term ''conformer model" refers 

15 to a representation of points in a defined coordinate system 
wherein a point corresponds to a position of an atom in a 
bound conformation of a ligand. The coordinate system is 
preferably in 3 dimensions, however, manipulation or 
computation of a model can be performed in 2 dimensions or 

20 even 4 or more dimensions in cases where such methods are 

preferred. A point in the representation of points can, for 
example, correlate with the center of an atom. 
Additionally, a point in the representation of points can be 
incorporated into a line, plane or sphere to include a shape 

25 of one or more atom or volume occupied by one or more atom. 
A conformer model can be derived from 2 or more bound 
conformations of a ligand. For example a conformer model 
can be generated from 3 or more, 4 or more, 5 or more, 6 or 
more, 7 or more, 8 or more, 10 or more, 15 or more, 2 0 or 

3 0 more or 2 5 or more bound conformations of a ligand. 
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As used herein, the term "average structure," when 
used in reference to bound conformations of a ligand in a 
pharmacocluster , refers to conformer model, derived by 
superimposing the bound conformations of a ligand in a 
5 pharmacocluster, and determining an average location in 
space for corresponding atoms . 

As used herein, the term "pharmacophore model" 
refers to a representation of points in a defined coordinate 
system wherein a point corresponds to a position or other 

10 characteristic of an atom or chemical moiety in a bound 

conformation of a ligand and/or an interacting polypeptide 
or ordered water. An ordered water is an observable water 
in a model derived from structural determination of a 
polypeptide. A pharmacophore model can include, for 

15 example, atoms of a bound conformation of a ligand, or 

portion thereof. A pharmacophore model can include both the 
bound conformations of a ligand, or portion thereof, and one 
or more atoms that both interact with the ligand and are 
from a bound polypeptide. Thus, in addition to geometric 

20 characteristics of a bound conformation of a ligand, a 
pharmacophore model can indicate other characteristics 
including, for example, charge or hydrophobicity of an atom 
or chemical moiety* A pharmacaphore model can incorporate 
internal interactions within the bound conformation of a 

25 ligand or interactions between a bound conformation of a 
ligand and a polypeptide or other receptor including, for 
example, van der Waals interactions, hydrogen bonds, ionic 
bonds, and hydrophobic interactions. A pharmacophore model 
can be derived from 2 or more bound conformations of a 

3 0 ligand. For example a conformer model can be generated from 
3 or more, 4 or more, 5 or more, 6 or more, 7 or more, 8 or 
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more, 10 or more, 15 or more, 2 0 or more or 2 5 or more bound 
conformations of a ligand. 



A point in a pharmacophore model can, for example, 
correlate with the center of an atom or moiety. 
5 Additionally, a point in the representation of points can be 
incorporated into a line, plane or sphere to indicate a 
characteristic other than a center of an atom or moiety 
including, for example, shape of an atom or moiety or volume 
occupied by an atom or moiety. The coordinate system of a 

10 pharmacophore model is preferably in 3 dimensions, however, 
manipulation or computation of a model can be performed in 2 
dimensions or even 4 or more dimensions in cases where such 
methods are preferred. Multidimensional coordinate systems 
in which a pharmacophore model can be represented include, 

15 for example, cartesian coordinate systems, fractional 
coordinate systems, or reciprocal space. The term 
pharmacophore model is intended to encompass a conformer 
model , 



As used herein, the term ''moiety" refers to a 
2 0 group of atoms that form a part or portion of a larger 

molecule. A moiety can consist of any number of atoms in a 
portion of a ligand and can correlate with a physical or 
chemical property conferred upon the ligand by the combined 
atoms. Exemplary moieties of a nicotinamide adenine 
25 dinucleotide ligand include a phosphate, nicotinamide ring, 
amino group, amide group or ribose ring. In addition, a 
nicotinamide adenine dinucleotide group can be a moiety. 
For example, a nicotinamide adenine dinucleotide can be a 
moiety of the 2'P phosphate in a nicotinamide adenine 
30 dinucleotide phosphate molecule (see Figure 2 for location 
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of the 2'P phosphate in nicotinamide adenine dinucleotide 
phosphate) . 



The invention provides a method for identifying a 
5 pharmacocluster , The method includes the steps of (a) 
determining bound conformations of a ligand bound to 
different polypeptides, and (b) clustering two or more bound 
conformations of the ligand having substantially the same 
bound conformation, thereby identifying a pharmacocluster. 

10 The invention also provides a method for identifying a 

member of a pharmacocluster. The method includes the steps 
of (a) determining a bound conformation of a ligand bound to 
a polypeptide; and (b) determining a pharmacocluster having 
substantially the same bound conformation as the bound 

15 conformation, thereby identifying the bound conformation of 
the ligand as a member of the pharmacocluster. 



A bound conformation of a ligand bound to a 
polypeptide can be determined from a previously observed 
molecular structure or from data specifying a molecular 

2 0 structure for a bound conformation of a ligand. Previously 
observed structures can be acquired for use in the invention 
by searching a database of existing structures. An example 
of a database that includes structures of bound 
conformations of ligands bound to polypeptides is the 

25 Protein Data Bank (PDB, operated by the Research 

Collaboratory for Structural Bioinf ormatics, see Berman et 
al., Nucleic Acids Research . 28:235-242 (2000)). A database 
can be searched, for example, by querying based on chemical 
property information or on structural information. In the 

30 latter approach, an algorithm based on finding a match to a 
template can be used as described, for example, in Martin, 
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"Database Searching in Drug Design," J. Med. Cham. 35:2145- 
2154 (1992) . 



A bound conformation of a ligand bound to a 
polypeptide can be determined from an empirical measurement, 
5 or from a database. Data specifying a structure can be 
acquired using any method available in the art for 
structural determination of a ligand bound to a polypeptide. 
For example. X-ray crystallography can be performed with a 
crystallized complex of a polypeptide and ligand to 

10 determine a bound conformation of the ligand bound to the 
polypeptide. Methods for obtaining such crystal complexes 
and determining structures from them are well known in the 
art as described for example in McRee et al . , Practical 
Protein Crystallography , Academic Press, San Diego 1993; 

15 Stout and Jensen, X-ray Structure Determination: A practical 
guide, 2^^ Ed, Wiley, New York (1989) ; and McPherson, The 
Preparation and Analysis of Protein Crystals , Wiley, New 
York (1982) . Another method useful for determining a bound 
conformation of a ligand bound to a polypeptide is Nuclear 

20 Magnetic Resonance (NMR) . NMR methods are well known in the 
art and include those described for example in Reid, Protein 
NMR Techniques . Humana Press, Totowa NJ (1997) ; and 
Cavanaugh et al . , Protein NMR Spectroscopy: Principles and 
Practice , ch. 7, Academic Press, San Diego CA (1996) . 



25 A bound conformation of a ligand can also be 

determined from a hypothetical model. For example, a 
hypothetical model of a bound conformation of a ligand can 
be produced using an algorithm which docks a ligand to a 
polypeptide of known structure and fits the ligand to the 

30 polypeptide binding site. Algorithms available in the art 
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for fitting a ligand structure to a polypeptide binding site 

include, for example, DOCK (Kuntz et al., J. Mol. Biol. 

161:269-288 (1982)) and INSIGHT98 (Molecular Simulations 
Inc., San Diego, CA) . 

5 A molecular structure can be conveniently stored 

and manipulated using structural coordinates. Structural 
coordinates can occur in any format known in the art so long 
as the format can provide an accurate reproduction of the 
observed structure. For example, crystal coordinates can 

10 occur in a variety of file types including, for example, 

.fin, .df, .phs, or .pdb as described for example in McRee, 
supra. Although the examples above describe structural 
coordinates derived from X-ray crystallographic analysis or 
NMR spectroscopy, one skilled in the art will recognize that 

15 structural coordinates can be derived from any method known 
in the art to determine a bound conformation of a ligand 
bound to a polypeptide. 

Structures at atomic level resolution can be 
useful in the methods of the invention. Resolution, when 

2 0 used to describe molecular structures, refers to the minimum 
distance that can be resolved in the observed structure. 
Thus, resolution where individual atoms can be resolved is 
referred to in the art as atomic resolution. Resolution is 
commonly reported as a numerical value in units of Angstroms 

25 (A, 10"^° meter) correlated with the minimum distance which 
can be resolved such that smaller values indicate higher 
resolution. Bound conformations of a ligand useful in the 
methods of the invention can have a resolution better than 
about 10 A, 5 A, 3 A, 2.5 A, 2.0 A, 1.5 A, 1 . 0 A, 0 . 8 A, 0.6 

30 A, 0.4 A, or about 0 . 2 A or better. Resolution can also be 
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reported as an all atom RMSD as used, for example, in 
reporting NMR data. Bound conformations of a ligand useful 
in the methods of the invention can have an all atom RMSD 
better than about 10 A, 5 A, 3 A, 2.5 A, 2 . 0 A, 1.5 A, 1.0 
5 A, 0.8 A, 0.6 A, 0.4 A, or about 0 . 2 A or better. 

An advantage of the methods of the invention is 
that a structure of a polypeptide bound to a bound 
conformation of a ligand need not be determined to identify 
a pharmacocluster . Thus, methods that detect only the 

10 structure of the ligand can be used in the invention. In 

some cases determination or refinement of only the structure 
of the ligand in a polypeptide- ligand complex will be 
required. In addition, methods that detect a conformation- 
dependent property of the ligand can be used to identify a 

15 pharmacocluster. Methods that can be used to determine a 
conformation-dependent property of a ligand in a 
polypeptide -ligand complex without determining the structure 
of the polypeptide include, for example. Electron Nuclear 
Double Resonance spectroscopy (ENDOR, as described in Van 

20 Doorslaer and Schweiger, Naturwissenschaf ten 87:245- 

55(2000)), Electron Paramagnetic Resonance spectroscopy 
(EPR, described in Cantor and Schimmel Biophysical 
Chemistry, Part I: The conformation of biological 
macromolecules W. H. Freeman and Company (1980)), chemically 

25 induced dynamic nuclear polarization (CIDNP, described in 
Siebert et al . , Glycoconi J. 14 : 945-9 (1997) and Consonni et 
al., FEBS Lett. 372:135-9 (1995)), solid state NMR 
(described in Mehring, M. High Resolution NMR spectroscopy 
in Solids , 2^^ ed. Springer- Verlag, Berlin (1983) and liquid 

30 phase NMR (described in Wiithrich, NMR of Proteins and 

Nucleic Acids John Wiley & Sons, Inc. (1986)). Thus, the 
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invention can be performed in a manner whereby the time and 
cost associated with a full determination of a polypeptide 
structure is avoided. 

5 Any representation that correlates with the 

structure of a bound conformation of a ligand can be used in 
the methods of the invention. For example, a convenient and 
commonly used representation is a displayed image of the 
structure. Displayed images that are particularly useful 

10 for determining the bound conformation of a ligand bound to 
polypeptides include, for example, ball and stick models, 
density maps, space filling models, surface map, Connolly 
surfaces, Van der Waals surfaces or CPK model. Display of 
images as a computer output, for example, on a video screen 

15 can be advantageous as described below. 

Clustering can be performed with any ligand or any 
number of bound conformations of a ligand. The methods of 
the invention can be performed by clustering 2 or more bound 
conformations of a ligand. For example, clustering can be 

2 0 performed with 3 or more, 4 or more, 5 or more, 6 or more, 7 
or more, 8 or more, 9 or more, 10 or more, 11 or more, 12 or 
more, 13 or more, 14 or more, 15 or more or 2 0 or more bound 
conformations of a ligand. The methods of the invention can 
be used with any number bound conformations of a ligand. 

25 Due to the large sizes of data sets required to represent 

bound conformations of a ligand, methods of clustering bound 
conformations are generally performed on a computer. The 
methods are compatible with any computer that can support 
molecular modeling software including for example a personal 

30 computer, silicon graphics workstation, or supercomputer. A 
variety of computer software programs are available for 



• 
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molecular modeling including, for example, GRASP (Nicholls, 
A., supra), ALADDIN (Van Drie et al . supra) , INSIGHT98 
(Molecular Simulations Inc., San Diego CA) , RASMOL (Sayle et 
al., Trends Biochem Sci, 20:374-376 (1995)) and MOLMOL 
5 (Koradi et al., J. Mol . Graphics 14:51-55 (1996 )). 



different polypeptides has been determined, two or more 
bound conformations of the ligand can be compared and those 
having substantially the same bound conformation can be 

10 clustered. Methods of comparison include, for example, a 
method that provides alignment of two or more bound 
conformations of a ligand and evaluation of the degree of 
overlap in the two structures. Methods of comparison can be 
performed in an iterative fashion until a best fit is 

15 identified. 



ligands include, for example, cluster analysis, visual 
inspection and pairwise structural comparisons. Cluster 
analysis is commonly performed by, but not limited to, 

20 partitioning methods or hierarchical methods as described, 
for example, in Kauffman and Rousseeuw, Finding Groups in 
Data: An Introduction to Cluster Analysis , John Wiley and 
Sons Inc., New York (1990). Partitioning methods that can 
be used include, for example, partitioning around mediods, 

25 clustering large applications, and fuzzy analysis, as 

described in Kauffman and Rousseeuw, supra. Hierarchical 
methods useful in the invention include, for example, 
agglomerative nesting, divisive analysis, and monothetic 
analysis, as described in Kauffman and Rousseeuw, supra, 

3 0 Algorithms for cluster analysis of molecular structures are 



Once a bound conformation of a ligand bound to 



Methods of comparing bound conformations of bound 
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known in the art and include, for example, COMPARE (Chiron 
Corp, 1995; distributed by Quantum Chemistry program 
Exchange, Indianapolis IN) . COMPARE can be used to make all 
possible pairwise comparisons between a set of conformations 
5 of the same ligand(s) . COMPARE reads PDB files and uses a 
Ferro-Hermanns ORIENT algorithm for a least squares root 
mean square (RMS) fit. The structures can be clustered into 
groups using the Jarvis- Patrick nearest neighbors algorithm. 
Based on the RMS deviation between ligand conformers, a list 

10 of 'nearest neighbors' for each conformer are generated. 
Two conformers are then grouped together or clustered if: 
(1) the RMS deviation is sufficiently small and (2) if both 
conformers share a determined number of common 'neighbors' . 
Both criteria are adjusted by the program to generate 

15 clusters based on a user defined cutoff for distance between 
individual clusters. Follow up analysis was conducted using 
Insightll to verify clusters. A member conformation is 
identified as being closer to the averaged coordinates of 
conformations within its family than to the averaged 

20 coordinates of any other family. 



Using methods such as those described above, one 
skilled in the art will know how to identify conformations 
that are substantially the same. For example, similarity 
can be evaluated according to the goodness of fit between 

25 two or more bound conformations of a ligand. Goodness of 
fit can be represented by a variety of parameters known in 
the art including, for example, the root mean square 
deviation (RMSD) . A lower RMSD between structures 
correlates with a better fit compared to a higher RMSD 

3 0 between structures. Bound conformations of a ligand having 
substantially the same conformations can be identified by 
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comparing mean RMSD values within and between 
pharmacoclusters . Accordingly, bound conformations of a 
ligand having substantially the same conformations can have 
a mean RMSD compared to an average structure for the 
5 pharmacocluster that is less than 1.1 A. Two or more bound 
conformations of a ligand can be clustered by assigning 
bound conformations of a ligand into a collection such that 
the conformations of a ligand residing in the collection are 
substantially the same. Members of a pharmacocluster can 
10 also be identified as having RMSD values compared to an 

average structure for the pharmacocluster that are less than 
1.0 A, 0.9 A, 0.8 A, 0.7 A, 0.6 A, 0.5 A, 0.4 A, 0.3 A, 0.2 
A or 0 . 1 A. 

A bound conformation of a ligand that is a member 
15 of a pharmacocluster can also be identified by comparing the 
RMSD for the bound conformation to an average conformation 
of the members in multiple pharmacoclusters. Using this 
value for comparison, a member conformation is identified as 
having a smaller RMSD when compared to the averaged 

2 0 coordinates of conformations within its family than when 

compared to the averaged coordinates of any other family. 
In addition, a member of a pharmacocluster can be identified 
as having an RMSD compared to an average conformation of the 
members in a pharmacocluster that is smaller than the RMSD 
25 between each family's average coordinates. For example, as 
described in Example I, RMSD values for members of 
pharmacoclusters 1-8 as presented in Tables 3A, 4A, 5A, 6A, 
7A, 8A, 9A or lOA, respectively, can be compared to RMSD 
values between each pharmacocluster as presented in Table 2. 

3 0 Comparisons similar to those described above can be made for 
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bound conformations of any ligand according to the methods 
described in the Examples. 



In addition, bound conformations of a ligand can 
be compared with respect to dihedral angles at particular 
5 bonds . Exemplary methods for comparing dihedral angles 

between pharmacoclusters is described in Example I and Table 
1. Comparison between dihedral angles can be used, for 
example, in combination with overall RMSD comparisons such 
as those described above. Therefore, bound conformations 

10 that are not easily distinguished by comparison of overall 
RMSD alone, can be distinguished according to the combined 
comparison of RMSD and dihedral angle. Bound conformations 
of a ligand that are members of different pharmacoclusters 
can have dihedral angles that differ, for example, by at 

15 least about 10 degrees, 30 degrees, 45 degrees, 90 degrees 
or 18 0 degrees . 

The invention also provides a pharmacocluster 
selected from the cluster consisting of pharmacocluster 1, 
pharmacocluster 2, pharmacocluster 3, pharmacocluster 4, 
2 0 pharmacocluster 5, pharmacocluster 6, pharmacocluster 7, and 
pharmacocluster 8 correlated with the pharmacof amilies 
listed in Table 11. 



Pharmacoclusters 1 through 8 contain bound 
conformations of NAD(P) (H) determined from structures 
25 deposited in the PDB for NAD(P) (H) bound to oxidoreductase 
polypeptides. Pharmacoclusters are shown in Figure 1 and 
described in further detail in Example I. The 
pharmacoclusters of Figure 1 display substantial overlap 
between bound conformations of NAD(P) (H) within the cluster. 
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as can be identified by visual inspection of the structures. 
Quantitative comparison of the bound conformations in each 
pharmacocluster demonstrates that each pharmacocluster 
displays less than about 1.1 A difference in RMSD between 
5 each conformation of NAD(P) (H) and the average bound 

conformation for the respective pharmacocluster as described 
in Example I . 

Pharmacoclusters can be used to identify a ligand 
having specificity for one or more polypeptide 

10 pharmacof amilies (see Example V) . As described herein, a 
pharmacophore model or conformer model can be derived from 
one or more cluster. These models can be used to identify a 
ligand having specificity for one or more pharmacof amilies 
of oxidoreductases, for example, by using the model to query 

15 a database of molecules for a potential ligand or by using 
the model to guide in the design of a synthetic ligand. An 
example of using a pharmacophore of the invention to 
identify a binding compound is provided in Example VI. 

Pharmacoclusters, including, for example, 
2 0 pharmacoclusters 1 through 8 can also be used to identify a 
new polypeptide member of a polypeptide pharmacof ami ly. 
Using the methods described herein, for example, a 
pharmacocluster can be used to produce a pharmacophore model 
or conformer model to which a bound conformation of a ligand 
25 can be compared. A polypeptide bound to a bound 

conformation of a ligand that is similar to the model can be 
classified into an appropriate polypeptide pharmacof amily 
based on this comparison. By a similar method, a bound 
conformation of a ligand can be directly compared to a 
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pharmacocluster to classify the polypeptide bound to the 
conformation of a ligand into an appropriate pharmacof amily , 



The methods of the invention can also be used with 
a portion of a bound conformation of a ligand to identify a 
5 pharmacocluster. The method consists of (a) determining a 
bound conformation of a ligand, or portion thereof, bound to 
two or more polypeptides, and (b) clustering two or more 
bound conformations of the ligand, or portion thereof having 
substantially the same bound conformation, thereby 
10 identifying a pharmacocluster. 

A bound conformation of a portion of a ligand can 
include selected atoms and/or bonds of a ligand and can 
include, for example, a continuous sequence of atoms and/or 
bonds or a discontinuous sequence of selected atoms and/or 
15 bonds that, when described independent of the complete 
ligand structure, may not appear to be attached to each 
other. Such a portion can include 2 or more atoms of a 
bound conformation of a ligand or 3 or more, 4 or more, 5 or 
more, 6 or more, 7 or more, 8 or more, 9 or more, 10 or 

2 0 more, 15 or more, 2 0 or more, 2 5 or more or 50 or more atoms 

of a bound conformation of a ligand. A bound conformation 
of a portion of a ligand bound to a polypeptide can be 
identified according to the same methods described above for 
identifying a bound conformation of a ligand bound to a 
25 polypeptide. Two or more bound conformations of a portion 
of a ligand can be clustered as described above so long as 
the bound conformations that are clustered correspond to 
bound portions of the ligand having the same structural 
formula. For example, in a case where determination of the 

3 0 complete structure of a ligand has not been achieved, a 
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complete structure of a ligand has not been achieved, a 
bound conformation of a portion of the ligand corresponding 
to the structurally determined portion can be used in the 
methods of the invention. 

5 A pharmacocluster can include portions of bound 

conformations derived from different ligands so long as the 
portions have a core bound conformation that is 
substantially the same. For example, portions having the 
same structural formula and bond configuration can share a 

10 core bound conformation. The bond configuration describes 
the relative position of atoms attached to a chiral atom of 
a ligand. Accordingly, R and S sterioisomers of a chiral 
atom have different bond configurations. Other terms used 
in the art to designate different bond configurations 

15 include, for example, cis and trans configurations of atoms 
attached to carbons that are double bonded, or Z and E 
configurations of atoms attached to carbons that are double 
bonded. An example of portions of ligands having the same 
structural formula and bond configuration that can share a 

2 0 core bound conformation are the nicotinamide adenine 

dinucleotide portions of nicotinamide adenine dinucleotide 
phosphate (NADP) and nicotinamide adenine dinucleotide 
(NAD) . Additionally, portions of ligands having different 
charge, atom substitution or bond hybridization can share a 
25 core bound conformation. An example of portions of ligands 
having different charge and bond hybridization that can 
share a core bound conformation are the nicotinamide adenine 
dinucleotide portions of oxidized nicotinamide adenine 
dinucleotide (NAD) and reduced nicotinamide adenine 

3 0 dinucleotide (NADH) . In cases where the core structures of 

two ligands bind with substantially the same conformation to 
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polypeptides, the core bound conformations can be clustered 
according to the methods of the invention (see Example I) , 



Substantially the same bound conformation of a 
portion of a bound conformation of a ligand, including non- 
5 continuous atoms, can be identified according to the root 
mean square deviation and compared directly. Conformations 
of portions having different numbers of atoms can also be 
compared via root mean square deviation per equivalent atom 
(RMSD/N, where N is the number of atoms compared) . A lower 

10 value of RMSD/N indicates increased similarity between the 
two or more bound ligand conformations that are clustered. 
One skilled in the art will know that RMSD/N has a 
compensational origin and consideration of the effect of N 
is required for comparison of RMSD/N between 

15 pharmacoclusters having different values of N. For example, 
the lower the value of RMSD/N the lower should be the value 
of N to indicate substantial similarity. 



The invention can be used with any ligand for 
which bound conformations of the ligand bound to different 
20 polypeptides can be determined including, for example, 

chemical or biological molecules such as simple or complex 
organic molecules, metal -containing compounds, 
carbohydrates , peptides , pept idomimet ics , carbohydrates , 
lipids, nucleic acids, and the like. 

25 In one embodiment, the compositions and methods of 

the invention can be used with a ligand that is a nucleotide 
derivative including, for example, a nicotinamide adenine 
dinucleotide-related molecule. Nicotinamide adenine 
dinucleotide-related (NAD-related) molecules that can be 
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used in the methods of the invention can be selected from 
the group consisting of oxidized nicotinamide adenine 
dinucleotide (NAD^) , reduced nicotinamide adenine 
dinucleotide (NADH) , oxidized nicotinamide adenine 
5 dinucleotide phosphate (NADP^) , and reduced nicotinamide 
adenine dinucleotide phosphate (NADPH) . An NAD-related 
molecule can also be a mimetic of the above- described 
molecules. Use of a NAD-related molecule to identify 
pharmacoclusters is described in Example I. 

A mimetic is a molecule that has at least one 
function that is substantially the same as a function of a 
second molecule. A mimetic of a ligand can be identified 
according to its ability to bind to the same sites on a 
polypeptide as the ligand. For example, a mimetic can be 
identified by a binding competition assay using a ligand and 
a mimetic. The structure of a mimetic can be similar or 
different compared to the structure of the second molecule. 
The term can encompass molecules having portions similar to 
corresponding portions of the ligand in terms of structure 
or function. 

Examples of mimetics to the common ligand NADH, 
for example cibacron blue, are described in Dye -Ligand 
Chromatography , Amicon Corp., Lexington MA (1980). Numerous 
other examples of NADH-mimics, including useful 
25 modifications to obtain such mimics, are described in Everse 
et al . (eds.). The Pyridine Nucleotide Coenzymes , Academic 
Press, New York NY (1982) . Particular analogs include 
nicotinamide 2-aminopurine dinucleotide, nicotinamide 8- 
azidoadenine dinucleotide, nicotinamide 1-deazapurine 
3 0 dinucleotide, 3 -aminopyridine adenine dinucleotide, 3 -acetyl 
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pyridine adenine dinucleotide, thiazole amide adenine 
dinucleotide, 3 -diazoacetylpyridine adenine dinucleotide and 
5-aminonicotinamide adenine dinucleotide. Particular 
mimetics can be identified and selected by ligand- 
5 displacement assays, for example using competitive binding 
assays with a known ligand as is well known in the art. 
Mimetic candidates can also be identified by searching 
databases of compounds for structural similarity with the 
common ligand or a mimetic. 

In another embodiment, the methods of the 
invention can be used with a ligand that is an adenosine 
phosphate -related molecule. Adenosine phosphate-related 
molecules can be selected from the group consisting of 
adenosine triphosphate (ATP) , adenosine diphosphate (ADP) , 
adenosine monophosphate (AMP) , and cyclic adenosine 
monophosphate (cAMP) . An adenosine phophate-related 
molecule can also be a mimetic of the above-described 
molecules. A mimetic of an adenosine phosphate-related 
molecule that can be used in the invention includes, for 
example, quercetin, adenylylimidodiphosphate (AMP-PNP) or 
olomoucine . 

A ligand useful in the methods of the invention 
can be a cofactor, coenzyme or vitamin including, for 
example, NAD, NADP, or ATP as described above. Other 
25 examples include thiamine (vitamin B^) , riboflavin (vitamin 
B2) , pyridoximine (vitamin Eg) , cobalamin (vitamin B-L2) / 
pyrophosphate, flavin adenine dinucleotide (FAD) , flavin 
mononucleotide (FMN) , pyridoxal phosphate, coenzyme A, 
ascorbate (vitamin C) , niacin, biotin, heme, porphyrin, 
3 0 folate, tetrahydrof olate , nucleotide such as guanosine 
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triphosphate, cytidine triphosphate, thymidine triphosphate, 
uridine triphosphate, retinol (vitamin A) , calciferol 
(vitamin D2) , ubiquinone, ubiquitin, a- tocopherol (vitamin 
E) , farnesyl, geranylgeranyl , pterin, pteridine or S- 
5 adenosyl methionine (SAM) . 

A polypeptide can be used as a ligand in the 
invention. For example, a ligand can be a naturally 
occurring polypeptide ligand such as a ubiquitin or 
polypeptide hormone including, for example, insulin, human 

10 growth hormone, thyrotropin releasing hormone, 

adrenocorticotropic hormone, parathyroid hormone, follicle 
stimulating hormone, thyroid stimulating hormone, 
luteinizing hormone, human chorionic gonadotropin, epidermal 
growth factor, nerve growth factor and the like. In 

15 addition a polypeptide ligand can be a non-naturally 
occurring polypeptide that has binding activity. Such 
polypeptide ligands can be identified, for example, by 
screening a synthetic polypeptide library such as a phage 
display library or combinatorial polypeptide library as 

20 described below. A polypeptide ligand can also contain 
amino acid analogs or derivatives such as those described 
below. Methods of isolation of a polypeptide ligand are 
well known in the art and are described, for example, in 
Scopes, Protein Purification: Principles and Practice , S"""^ 

25 Ed., Springer-Verlag, New York (1994); Duetscher, Methods in 
Enzymology , Vol 182, Academic Press, San Diego (1990) ; and 
Coligan et al . , Current protocols in Protein Science , John 
Wiley and Sons, Baltimore, MD (2 000) . 
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A nucleic acid can also be used as a ligand in the 
invention. Examples of nucleic acid ligands useful in the 
invention include DNA, such as genomic DNA or cDNA or RNA 
such as mRNA, ribosomal RJSFA or tRNA. A nucleic acid ligand 
5 can also be a synthetic oligonucleotide. Such ligands can 
be identified by screening a random oligonucleotide library 
for ligand binding activity, for example, as described 
below. Nucleic acid ligands can also be isolated from a 
natural source or produced in a recombinant system using 

10 well known methods in the art including, for example, those 
described in Sambrook et al., Molecular Cloning: A 
Laboratory Manual , 2nd ed., Cold Spring Harbor Press, 
Plainview, New York (1989); Ausubel et al.. Current 
Protocols in Molecular Biology (Supplement 47) , John Wiley & 

15 Sons, New York (1999) . 

A ligand used in the invention can be an amino 
acid, amino acid analog or derivatized amino acid. An amino 
acid ligand can be one of the 20 essential amino acids or 
any other amino acid isolated from a natural source. Amino 

2 0 acid analogs useful in the invention include, for example, 

neurotransmitters such as gamma amino butyric acid, 
serotonin, dopamine, ^or norepenephrine or hormones such as 
thyroxine, epinephrine or melatonin. A synthetic amino 
acid, or analog thereof, can also be used in the invention. 
25 A synthetic amino acid can include chemical modifications of 
an amino acid such as alkylation, acylation, carbamylation, 
iodination, or any modification that derivatizes the amino 
acid. Such derivatized molecules include, for example, 
those molecules in which free amino groups have been 

3 0 derivatized to form amine hydrochlorides, p- toluene sulfonyl 

groups, carbobenzoxy groups, t-butyloxycarbonyl groups. 
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chloroacetyl groups or formyl groups. Free carboxyl groups 
can be derivatized to form salts, methyl and ethyl esters or 
other types of esters or hydrazides. Free hydroxyl groups 
can be derivatized to form 0-acyl or O-alkyl derivatives. 
5 The imidazole nitrogen of histidine can be derivatized to 
form N-im-benzylhistidine . Naturally occurring amino acid 
derivatives of the twenty standard amino acids can also be 
included in a cluster of bound conformations including, for 
example , 4 -hydroxyproline , 5 - hydroxy lysine , 
10 3-methylhistidine, homoserine, ornithine or 
carboxyglutamate . 

A lipid ligand can also be used in the invention. 
Examples of lipid ligands include triglycerides, 
phospholipids, glycolipids or steroids. Steroids useful in 
15 the invention include, for example, glucocorticoids, 

mineralocorticoids, androgens, estrogens or progestins. 



Another type of ligand that can be used in the 
invention is a carbohydrate, A carbohydrate ligand can be a 
monosaccharide such as glucose, fructose, ribose, 
2 0 glyceraldehyde, or erythrose; a disaccharide such as 

lactose, sucrose, or maltose; oligosaccharide such as those 
recognized by lectins such as agglutinin, peanut lectin or 
phytohemagglutinin, or a polysaccharide such as cellulose, 
chit in, or glycogen. 

25 Methods for producing pluralities of compounds to 

use as ligands, including chemical or biological molecules 
such as simple or complex organic molecules, metal- 
containing compounds, carbohydrates, peptides, 
peptidomimetics, carbohydrates, lipids, nucleic acids, and 
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the like, are well known in the art (see, for example, in 
Huse, U.S. Patent No. 5,264,563; Francis et al . , Curr. Opin. 
Chem. Biol . 2:422-428 (1998); Tietze et al . , Curr. Biol., 
2:363-371 (1998); Sofia, Mol . Divers . 3:75-94 (1998); 
5 Eichler et al . , Med. Res. Rev. 15:481-496 (1995); Gordon et 
al., J. Med. Chem. 37: 1233-1251 (1994); Gordon et al , , 
Med. Chem. 37: 1385-1401 (1994); Gordon et al . , Acc. Chem. 
Res . 29:144-154 (1996); Wilson and Czarnik, eds . , 
Combinatorial Chemistry: Synthesis and A pplication. John 

10 Wiley & Sons, New York (1997), Gold et al . , U.S. Pat Nos . 
5,475,096 (1995), 5,789,157 (1998), and 5,270,163 (1993)). 
The advantage of using such a combinatorial library is that 
molecules do not have to be individually generated to 
identify a ligand that binds a polypeptide. Also, no prior 

15 knowledge of the exact characteristics of a binding 

polypeptide is required when using a combinatorial library. 
Libraries containing large numbers of natural and synthetic 
compounds also can be individually synthesized or obtained 
from commercial sources. 

20 In addition, the invention provides a method for 

identifying a conformation-dependent property of a ligand. 
The method includes the steps of (a) determining bound 
conformations of a ligand bound to different polypeptides; 

(b) identifying two or more bound conformations of the 

25 ligand having substantially the same bound conformation, and 

(c) identifying a conformation-dependent property of the 
bound conformations of the ligand having substantially the 
same bound conformation, the conformation-dependent property 
being correlated with the bound conformation of the ligand. 
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A conformation-dependent property can be 
identified as any property that correlates with a bound 
conformation of a ligand such that a change in the bound 
conformation results in a change in the conf ormation- 
5 dependent property. Accordingly, a bound conformation of a 
ligand, or a portion thereof, can be a conformation- 
dependent property. A portion of a bound conformation of a 
ligand can be a contiguous fragment or a non-contiguous set 
of atoms or bonds. A bound conformation of a ligand, or 
10 portion thereof, can be identified by any method for 

determining the three dimensional structure of a ligand 
including as disclosed herein. 

Other conformation-dependent properties include, 
for example, absorption and emission of heat, absorption and 
15 emission of electromagnetic radiation, rotation of polarized 
light, magnetic moment, spin state of electrons, or 
polarity, as disclosed herein, or other properties that can 
be identified as a spectroscopic signal. Methods known in 
the art for measuring changes in absorption and emission of 

2 0 heat that correlate with changes in bound conformation of a 

ligand include, for example, calorimetry. Methods known in 
the art for measuring changes in absorption and emission of 
electromagnetic radiation as they correlate with changes in 
bound conformation of a ligand include, for example, UV/VIS 
25 spectroscopy, fluorimetry, luminometry, infrared 
spectroscopy, Raman spectroscopy, resonance Raman 
spectroscopy. X-ray absorption fine structure spectroscopy 
(XAFS) and the like. A change in a bound conformation of a 
ligand that is correlated with a change in rotation of 

3 0 polarized light can be measured with circular dichroism 

spectroscopy or optical rotation spectroscopy. A change in 
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magnetic moment or spin state of an electron that correlates 
with a change in a bound conformation can be measured, for 
example, with Electron paramagnetic resonance spectroscopy 
(EPR) or nuclear magnetic resonance spectroscopy (NMR) , 



property can be identified as an NMR signal including, for 
example, chemical shift, J coupling, dipolar coupling, 
cross-correlation, nuclear spin relaxation, transferred 
nuclear Overhauser effect, and any combination thereof. A 

10 conformation-dependent property can be identified by NMR 
methods in both fast and slow exchange regimes . For 
example, in many cases, the exchange rate of a complex 
between ligand and polypeptide is faster than the ligand 
spin relaxation rate (I/T-lh) . In this situation, referred 

15 to as the ''fast exchange regime," transferred nuclear 
Overhauser effect (NOE) experiments can be performed to 
measure an intra- ligand proton-proton distance (Wuthrich, 
NMR of proteins and Nucleic Acids , Wiley, New York (1986) 
and Gronenborn, J. Magn. Res. 53:423-442 (1983)). Labeling 

20 of polypeptides is not required, and the ligand polypeptide 
concentration ratio can be adjusted to minimize line 
broadening of the ligand resonances while retaining strong 
NOE contribution from the bound form. 



25 relaxation measurements can also provide structural 

information on ligand torsion angles (Carlomagno et al . , J. 
Am. Chem Soc . 121:1945-1948 (1999)). These measurements 
include the ^H-^H dipole-dipole cross-correlation but can be 
extended to other cross-correlated relaxation mechanisms 

3 0 involving also homo- and heteronuclear chemical shielding 



5 



When based on NMR data, a conformation -dependent 



In a fast exchange regime, cross-correlated 
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anisotropy relaxation, as well as quadrupolar relaxation. 
For most of these heteronuclear experiments, the natural 
abundance of the isotope can be exploited. In cases where 
natural abundance of the isotope measured is not sufficient, 
5 isotope enriched ligands can be obtained from commercial 

sources such as Isotek (Miamisburg, OH) or Cambridge Isotope 
Laboratories (Andover, MA) or prepared by methods known in 
the art. Another method to determine a conformation- 
dependent property of a ligand in a fast exchange regime is 
10 use of residual homo- and heteronuclear dipolar couplings in 
partially aligned samples (Tolman et al. Proc. Natl. Acad. 
Sci. USA 92:9279-9283 (1995)). 



In the slow exchange regime, the NMR signals 
arising from the bound conformation of the ligand are 

15 distinguished from those of the polypeptide to reduce 
resonance overlap. This can be achieved with different 
isotope labeling schemes of polypeptide, ligand or both. 
For large systems, perdeuteration of macromolecules and 
TROSY-type experiments (Pervushkin, Proc . Natl. Acad. Sci. 

20 USA 94:12366-12371 (1997)) can be used to minimize signal 
losses due to fast transverse relaxation of the resonances 
of the complex. With the appropriate sample requirements 
and isotope filtered experiments, cross-correlations, cross- 
relaxations and residual dipolar couplings can be measured 

25 and provide necessary structural information. 

In addition, homo- and heteronuclear two and three 
bond J couplings can be obtained to provide information on 
torsion angles (Wuthrich, supra.) . For example, as shown in 
Table 1 the bound conformations of NADP in pharmacocluster 4 
30 and pharmacocluster 5 differ by a torsion angle defined by 
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the atoms PN-05 ' N-C5 ' N-C4 ' N (See Figure 2 for atom labeling 
and bond location) . Specifically, pharmacocluster 4 has a 
PN-05 'N-C4 'N torsion angle of 145 degrees and 
pharmacocluster 5 has a PN-05 ^ N-C5 ' N-C4 ^ N angle of -112 
5 degrees. These torsion angles can be measured and 
distinguished by measuring the three bond ^^P-^-^C4 ' J 
coupling constants that correspond to this torsion angle 
(Marino, Acc . Chem. Res. 32:614-623 (1999)). Basically, two 
^H-^^C correlation experiments can be performed with and 
10 without ^^P decoupling during ^^C evolution. The intensity 
ratio of the 4 ' /^^C4 ' cross peak from each experiment is 
proportional to the ^^p-^^C4' J coupling constant. 

Correlation of a conformation- dependent property 
with a bound conformation of a ligand can be achieved by any 

15 method that has sufficient sensitivity to detect changes 
that correlate with changes in bound conformation of a 
ligand. Such a correlation can be determined by measuring a 
conformation-dependent property for various conformations of 
a ligand and determining the extent of change in the signal 

20 with change in the conformation. Signal changes that 
correlate with changes in conformation and that are 
detectable with a signal to noise ratio accepted in the art 
as significant can be used in the invention. 

Correlation between a conformation-dependent 
25 property and a conformation can be determined for a ligand 
bound to any partner so long as binding is specific and 
stable. For example, for purposes of establishing a 
correlation, changes in a conformation dependent property 
that correlate with changes in bound conformation of a 
3 0 ligand can be determined for a ligand bound to polypeptides 
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from different polypeptide pharmacof amilies . A bound 
conformation of the ligand in each complex can be determined 
and a conformation-dependent property can be measured for 
each complex. Comparison of bound conformations of the 
5 ligand in each complex with a measured conformation- 
dependent property can be used to establish a correlation. 
Demonstration of a method for establishing a correlation 
between an NMR signal and bound conformations of a ligand is 
described herein (see Example IV) . Other methods for 

10 correlating spectroscopic signals with bound conformations 
of a ligand are known in the art including, for example, 
correlation of transferred NOE signals with anti and syn 
conformations of the nicotinamide ring in NADPH as described 
in Sem and Kasper Biochemistry 31:3391-3398 (1992) . 

15 Correlation of transferred NOE signals with conformation is 
also described in Clore and Gronenborn, J, Maan, Reson. 
48 :402-417 (1982) . 



A correlation between a bound conformation and a 
conformation-dependent property can also be established for 

20 a ligand bound to a non-polypeptide binding partner because 
a conformation-dependent property of a ligand can be 
independent of interactions that differ between binding 
partners so long as the ligand is in the same bound 
conformation when bound to the binding partners. Other 

25 binding partners include, for example, nucleic acids, 
carbohydrates, and synthetic organometallic complexes. 

A method of the invention for identifying a 
conformation-dependent property of a ligand can also include 
the steps of (a) determining a bound conformation of a 
3 0 ligand, or portion thereof, bound to two or more 
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polypeptides; (b) identifying two or more bound 
conformations of the ligand, or portion thereof, having 
substantially the same bound conformation, and (c) 
identifying a conformation- dependent property of the bound 
5 conformations of the ligand, or portion thereof, having 

substantially the same bound conformation, the conformation- 
dependent property being correlated with the bound 
conformation of the ligand, or portion thereof. A 
conformation-dependent property of a portion of a ligand can 
10 be identified, for example, by using the methods described 
above for identifying a conformation-dependent property of a 
ligand. 

The invention also provides a method for 
identifying a polypeptide pharmacof amily . The method 

15 includes the steps of (a) determining bound conformations of 
a ligand bound to different polypeptides of a polypeptide 
family, and (b) identifying two or more bound conformations 
of the ligand having substantially different bound 
conformations, thereby identifying at least two polypeptide 

20 pharmacof amilies exhibiting binding specificity for the two 
or more substantially different bound conformations of the 
ligand. 

A method for identifying a polypeptide 
pharmacof amily can include the steps of (a) determining 

25 bound conformations of a ligand bound to different 

polypeptides of a polypeptide family; (b) clustering bound 
conformations of a ligand having substantially the same 
conformations into pharmacoclusters ; and (c) identifying a 
first polypeptide that binds a bound conformation of a 

3 0 ligand in one pharmacocluster and a second polypeptide that 
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binds a bound conformation of a ligand in a second 
pharmacocluster as belonging to separate polypeptide 
pharmacof amilies , 

Polypeptides of a polypeptide family can be 
5 identified by their ability to specifically bind to the same 
ligand, or portion thereof. Specific binding between a 
polypeptide and a ligand can be identified by methods known 
in the art. Methods of determining specific binding 
include, for example, equilibrium binding analysis, 

10 competition assays, and kinetic assays as described in 

Segel, Enzyme Kinetics John Wiley and Sons, New York (1975) , 
and Kyte, Mechanism in Protein Chemistry Garland Pub. 
(1995) . Thermodynamic and kinetic constants can be used to 
identify and compare polypeptides and ligands that 

15 specifically bind each other and include, for example, 
dissociation constant (K^) , association constant (K^) , 
Michaelis constant (Kj , inhibitor dissociation constant 
(Kis) association rate constant (k^n) or dissociation rate 
constant (k^ff) . For example, a family can be identified as 

2 0 having members that can specifically bind a ligand with a 

of at most 10"^ M, 10"^ M, 10"^ M, 10"^ M, 10"'' M, 10"^ M, 10"^ 
M, 10"^° M, 10"^' M, or 10"^^ M or lower. 

A family of polypeptides that bind a ligand can 
contain a pharmacof ami ly that binds substantially the same 
25 conformation of the ligand, or portion thereof. The methods 
can be used to identify any number of pharmacof amilies in a 
family according to the number of different bound 
conformations of a ligand identified. In cases where two or 
more polypeptide pharmacof amilies reside in a polypeptide 

3 0 family, the pharmacof amilies can be distinguished according 
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to differences in bound conformations of a ligand bound to 
the polypeptides. In this case, a bound conformation of a 
ligand can be determined and compared according to the 
methods described herein. Polypeptides bound to different 
5 bound conformations of a ligand can be identified as those 
that do not show substantial overlap of all corresponding 
atoms when bound conformations are overlaid. Thus, 
polypeptides that bind different bound conformations of a 
ligand can be separated into different pharmacof amilies . 
10 Pharmacof amilies in turn can be identified as containing 
polypeptides that bind substantially the same bound 
conformation of a ligand (see Examples II and III) . 

A pharmacof ami ly of polypeptides identified by the 
methods of the invention can have additional similarities 

15 that correlate with similarities in bound conformation of a 
ligand. For example, a polypeptide pharmacof ami ly 
identified by the methods of the invention can consist of 
polypeptide members that share characteristics that are 
unique to the pharmacof ami ly when compared to one or more 

2 0 other polypeptides in a different pharmacof ami ly of the same 
family. Such characteristics can include, for example, 
protein fold, evolutionary relatedness, enzymatic activity, 
domain structure, subcellular localization, interaction 
partners, or participation in a similar metabolic or signal 

25 transduction pathway. A demonstration of a correlation 

between ligand bound conformation and another characteristic 
of polypeptides in a pharmacof amily is provided in Example 
II, which describes correlation of bound conformation of a 
ligand with polypeptide structure. 
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An example of a polypeptide family having multiple 
pharmacof amilies that can be identified by the methods of 
the invention includes NAD{P) (H) binding polypeptides. 
Polypeptide pharmacof amilies identified according to 
5 differences in bound conformations of NAD(P) (H) are 

described in Example II and Table 11. Thus, the methods can 
be used to identify a polypeptide pharmacof amily selected 
from the group consisting of pharmacof amily 1, 
pharmacof amily 2, pharmacof amily 3, pharmacof amily 4, 
10 pharmacof amily 5, pharmacof amily 6, pharmacof amily 1, and 
pharmacof amily 8 . 

The invention provides a polypeptide 
pharmacof amily, comprising polypeptides that bind to 
substantially the same bound conformation of a nicotinamide 
15 adenine dinucleotide-related molecule selected from 
pharmacof amily 1, pharmacof amily 2, pharmacof amily 3, 
pharmacof amily 4, pharmacof amily 5, pharmacof amily 6, 
pharmacof amily 7, and pharmacof amily 8 as listed in Table 
11. 

2 0 Pharmacof amilies 1 through 8 consist of the 

polypeptide members provided in Table 11 (see Example II) . 
The polypeptides in pharmacof amily 1 have the NAD{P) (H) 
binding Rossman fold in common, are all in the NAD{P) (H) 
binding Rossman SCOP Superfamily, and fall into the SCOP 
25 families of the amino- terminal domain of glyceraldehyde-3- 
phosphate dehydrogenase, the carboxy- terminal domain of 
alcohol/glucose dehydrogenase, the NAD binding domain of 
f ormate/glycerate dehydrogenase, the carboxy- terminal domain 
of amino acid dehydrogenase, or the amino- terminal domain of 

3 0 lactate & malate dehydrogenase. 
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The polypeptides in pharmacof amily 2 have the 
NAD(P) (H) binding Rossman fold in common, are all in the 
NAD(P) (H) binding Rossman SCOP Superf amily, and fall into 
the SCOP families of the <:arboxy-terminal domain of amino 
5 acid dehydrogenase, glyceraldehyde-3 -phosphate 

dehydrogenase, and 6-phosphogluconate dehydrogenase . 



The polypeptides in pharmacof amily 3 have the 
NAD(P) (H) binding Rossman fold in common, are all in the 
NAD(P) (H) binding Rossman SCOP Superf amily, and fall into 
10 the tyrosine-dependent oxidoreductase SCOP family. 



The polypeptides in pharmacof amily 4 have the 
heme-linked catalase fold and are in the heme-linked 
catalase SCOP superfamily and heme-linked catalase SCOP 
family. 



15 The polypeptides in pharmacof amily 5 have the p-a 

TIM barrel fold in common, are all in the NAD(P) (H) linked 
oxidoreductase SCOP Superfamily, and fall into the aldo-keto 
reductase SCOP family. 



The polypeptides in pharmacof amily 6 are 
2 0 dihydrof olate reductases that all show the dihydrof olate 
reductase fold and fall into the dihydrof olate reductase 
SCOP superfamily and family. 



The polypeptides in pharmacof amily 7 have the 
FAD/NAD (P) (H) binding domain fold in common, are all in the 
25 FAD/NAD (P) (H) binding domain SCOP Superfamily, and fall into 
the the amino-terminal and central domains of FAD/NAD linked 
reductase SCOP family. 
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The polypeptides in pharmacof amily 8 have the 
ferrodoxin like fold in common, are all in the ferrodoxin 
like SCOP Superfamily, and fall into the NADPH- cytochrome 
P450 reductase or reductase SCOP families. 

5 Polypeptide pharmacof amilies 1 through 8 were 

identified according to binding interactions with bound 
conformations of NAD(P) (H) in pharmacoclusters 1 through 8, 
as described in Example II. Accordingly, the invention 
provides a polypeptide pharmacof amily, comprising 
10 polypeptides that bind to a nicotinamide adenine 

dinucleotide-related molecule having a bound conformation 
selected from pharmacocluster 1, pharraacocluster 2, 
pharmacocluster 3, pharmacocluster 4, pharmacocluster 5, 
pharmacocluster 6, pharmacocluster 7, and pharmacocluster 8. 

15 

The invention additionally provides a method for 
identifying a member of a polypeptide pharmacof amily . The 
method consists of (a) determining a conformation-dependent 
property of a ligand bound to a polypeptide, and (b) 

20 determining a pharmacocluster having substantially the same 
conformation-dependent property as the conformation- 
dependent property determined for the bound ligand, wherein 
a polypeptide pharmacof amily binds the ligand in a 
conformation of the pharmacocluster, thereby identifying the 

25 polypeptide as a member of the polypeptide pharmacof amily . 
For example, the method can be used with a ligand such as a 
nicotinamide adenine dinucleotide-related molecule or 
adenosine phosphate-related molecule (see Examples II and 
III) . 
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The methods of the invention allow a new member of 
a polypeptide pharmacof amily to be identified based on 
correlation of a conformation-dependent property of a bound 
conformation of a ligand bound to a polypeptide with a 
5 conformation-dependent property es-tablished for a bound 
conformation of the ligand bound to another polypeptide in 
the same pharmacof amily . Thus, a classification can be made 
based on ligand structure without requiring determination of 
the bound conformation of the ligand. In one embodiment, 

10 the conformation-dependent property can be a model of a 

bound conformation. A bound conformation of a ligand bound 
to a test polypeptide can be determined, and the bound 
conformation can be compared to a pharmacocluster according 
to the methods described herein. Substantial overlap 

15 between the bound conformation of the ligand bound to the 
test polypeptide and another bound conformation of the 
ligand bound to a polypeptide in a pharmacof amily can be 
used to identify the test polypeptide as a member of that 
polypeptide pharmacof amily . 

20 In another embodiment, the conformation-dependent 

property can be a spectroscopic signal that is correlated 
with the conformation of a ligand. A spectroscopic signal 
can be measured for the ligand bound to a test polypeptide. 
The signal can be compared to a signal correlated with a 

25 bound conformation of a ligand bound to a polypeptide in a 
polypeptide pharmacof amily . Substantial similarity between 
the two signals indicates that the bound conformation of the 
ligand bound to the test polypeptide is substantially 
similar to the bound conformation of the ligand bound to the 

3 0 polypeptides of the pharmacof amily . Thus, the test 
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polypeptide can be identified as a member of the polypeptide 
pharmacof amily . 



The invention provides rapid and efficient methods 
that can be used in a high- throughput screening format. 
5 High- throughput methods can be useful for identifying a 
member of a polypeptide pharmacof amily . In a case where a 
conformation-dependent property can be rapidly detected and 
processed, automated methods can be created for measuring 
samples in rapid succession or measuring multiple samples in 

10 parallel. Automated methods can be used for rapidly 
handling samples including, for example, robotic 
instruments. A combination of automated sample handling 
methods with detection of a conformation-dependent property 
can, therefore, be useful in a high -throughput screening 

15 method. 



According to the methods of the invention a 
compound can be identified that has greater specificity for 
the polypeptides of one pharmacof amily than for other 
polypeptides in the same family. Such a compound can be 

20 used to identify new members of a pharmacophore family using 
a binding assay. For example, a mimetic or analog of a 
ligand can be identified that preferentially adopts a 
conformation more similar to conformations in a particular 
pharmacocluster than those in other pharmacoclusters . Such 

25 a mimetic or analog can be used in a any binding assay 
capable of detecting interactions with a polypeptide, 
including, for example, high- throughput methods. 
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A member of a polypeptide pharmacof amily can also 
be identified by searching a database of bound conformations 
of a ligand. For example, a bound conformation of a ligand 
that binds to a polypeptide of an identified pharmacof amily 
5 can be used as a query in a 3 dimensional search of a 
database containing bound conformations of a ligand. 
Overlap between the query conformation and a retrieved bound 
conformation of the ligand can be used to identify a 
polypeptide bound to the retrieved bound conformation of the 
10 ligand as a member of the same polypeptide pharmacof amily as 
a polypeptide that binds the query bound conformation (see 
Example I) , 

The invention also provides a method of modeling 
the three dimensional structure of a polypeptide. The 
15 method consists of (a) determining a conformation-dependent 
property of a ligand bound to a polypeptide; (b) determining 
a pharmacocluster having substantially the same 
conformation -dependent property as the conformation- 
dependent property determined for the bound ligand, wherein 

2 0 a polypeptide pharmacof amily binds the ligand in a 

conformation of the pharmacocluster, thereby identifying the 
polypeptide as a member of the polypeptide pharmacof amily , 
and (c) modeling the three dimensional structure of the 
polypeptide according to a structural model of the second 
25 member of the polypeptide pharmacof amily . 

As disclosed herein, polypeptides in a 
pharmacof amily can have similar characteristics including, 
for example, similar 3 dimensional structure. Therefore, 
the 3 dimensional structure of a polypeptide identified by 

3 0 the invention as a member of a pharmacof amily can be modeled 
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using a polypeptide that is in the same pharmacof amily and 
for which the structure is known. A variety of methods are 
known in the art for modeling the three dimensional 
structure of a polypeptide according to the amino acid 
5 sequence of the polypeptide and a structure of a second 
polypeptide used as a template. Available algorithms 
include, for example, GRASP (Nicholls, A., supra), ALADDIN 
(Van Drie et al . supra), INSIGHT98 (Molecular Simulations 
Inc., San Diego CA) , RASMOL (Sayle et al . , Trends Biochem 
10 Sci . 20:374-376 (1995)) and MOLMOL (Koradi et al . , J. Mol . 
Graphics 14:51-55 (1996 )). 

A model of a polypeptide determined by the methods 
of the invention can be useful for identifying a function of 
the polypeptide. For example, residues of a polypeptide 
15 that are involved in binding can be identified using a model 
of the invention. Residues identified as participating in 
binding can be modified, for example, to engineer new 
functions into a polypeptide, to reduce an intrinsic 
activity of a polypeptide, or to enhance an intrinsic 

2 0 activity of a polypeptide. In another example, a model of a 

polypeptide can be compared to other polypeptide structures 
to identify similar functions. Exemplary functions that can 
be identified from a polypeptide structure include binding 
interactions with other polypeptides and catalytic 
25 activities. 

The invention also provides a method for 
constructing a ligand conformer model by determining an 
average structure of the bound conformations of a ligand in 
a pharmacocluster . A method for constructing a ligand 

3 0 conformer model can include the steps of (a) determining 
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bound conformations of a ligand bound to different 
polypeptides; (b) clustering two or more bound conformations 
of the ligand having substantially the same bound 
conformation, thereby identifying a pharmacocluster , and (c) 
5 determining an average structure of the bound conformations 
of the ligand in the pharmacocluster . Additionally, a 
method for constructing a ligand conformer model can include 
the steps of (a) determining a bound conformation of a 
ligand bound to a polypeptide; (b) determining a 

10 pharmacocluster having substantially the same bound 

conformation as the bound conformation, thereby identifying 
the bound conformation of the ligand as a member of the 
pharmacocluster, and (c) determining an average structure of 
the bound conformations of the ligand in the 

15 pharmacocluster . 



An average structure of the bound conformations of 
a ligand in a pharmacocluster can be determined by a variety 
of methods known in the art. For example, an average 
structure can be determined by overlaying bound 

2 0 conformations, or portions thereof, and identifying an 

average location for each atom. Bound conformations in a 
group to be averaged can be overlayed relative to a single 
member or relative to a centroid position for each atom. 
Algorithms for determining an average structure are known in 

25 the art and include for example the OVERLAY routine in 
INSIGHT98 (Molecular Simulations Inc., San Diego CA) . 



The format of a ligand conformer model can be 
chosen based on the method used to generate the model and 
the desired use of the model. In this regard, a conformer 
30 model can be represented as a single structure. The 
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resulting structure can be a unique structure compared to 
the conformations in the pharmacocluster from which it was 
derived. Thus, the conformer model can be a new structure 
never before observed in nature. A model represented by a 
5 single structure can be useful for making visual comparisons 
by overlaying other structures with the model. A conformer 
model can also be represented as a plurality of structures 
incorporating all or a subset of the bound conformations in 
the pharmacocluster. A model represented by multiple 
10 structures can be useful for identifying a range of minor 
deviations in the model. 



In yet another representation, the conformer model 
can be a volume surrounding all or a subset of the bound 
conformations in the pharmacocluster. A model showing 
15 volume can be useful for comparing other structures in a 
fitting format such that a structure which fits within the 
volume of the model can be identified as substantially 
similar to the model. One approach that can be used to fit 
a structure to a volume is comparison of equivalent surface 

2 0 patches using gnomonic projection as described for example 

in Chau and Dean, J. Mol . Graphics 7:130 (1989). Use of a 
gnomonic projection to compare structures is also described 
in Doucet and Weber, Computer-Aided Molecular Design: Theory 
and Applications , Academic Press, San Diego CA (1996) . 
25 Algorithms which can be used to fit a structure to a volume 
are known in the art and include, for example, CATALYST 
(Molecular Simulations Inc., San Diego, CA) and THREEDOM 
which is a part of the INTERCHEM package which makes use of 
an Icosahedral Matching Algorithm (Bladon, J. Mol. Graphics 

3 0 7:130 (1989) for the comparison and alignment of structures. 

An exemplary method of identifying a binding compound by 
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searching a database of structures using a gnomonic 
projection is provided in Example V. 

A conformer model can be useful in querying a 
5 database of polypeptide structures to find other members of 
a polypeptide pharmacof amily . For example, a member of a 
polypeptide pharmacof amily can be identified by querying a 
database of bound conformations of a ligand to identify a 
retrieved bound conformation of a ligand that is 

10 substantially similar to the query structure, thereby 
identifying a polypeptide bound to the retrieved bound 
conformation as a member of the same pharmacof amily as a 
polypeptide bound to the query bound conformation. A 
conformer model can also be used to identify a new member of 

15 a polypeptide pharmacof amily by querying a database of one 
or more polypeptide structures using an algorithm that docks 
the conformer model, wherein a favorable docking result with 
a retrieved polypeptide indicates that the retrieved 
polypeptide is a member of the same polypeptide 

20 pharmacof amily as a polypeptide bound to the bound 
conformation used as a query. In the latter mode, a 
potential new member of a pharmacof amily from which the 
conformer model was derived can be identified. The database 
queries described above can be performed with algorithms 

25 available in the art including, for example, THREEDOM and 
CATALYST . 

An advantage of the invention is that a conformer 
model can be used to identify a binding compound that is 
specific for polypeptides of a pharmacof amily . For example, 
3 0 the conformer model can be compared to a structure of a 

compound or to a bound conformation of a ligand to identify 




57 



those having similar conformation. A conformer model can be 
further used to query a database of compounds to identify 
individual compounds having similar conformations. 

A conformer model of the invention can also be 
5 used to design a binding compound that is specific for 

polypeptides of one or more pharmacof amilies . The methods 
of the invention provide a conformer model that can be 
produced according to a cluster of bound conformations of a 
ligand that are specific for polypeptides of a 

10 pharmacof amily. A conformer model identified by these 

criteria can be used as a scaffold structure for developing 
a compound having enhanced binding affinity or specificity 
for polypeptides of a pharmacof amily . Such a scaffold can 
also be used to design a combinatorial synthesis producing a 

15 library of compounds which can be screened for enhanced 

binding affinity for polypeptide members of a pharmacof amily 
or specificity for polypeptide members of one pharmacof amily 
compared to polypeptide members of another pharmacof amily . 
An algorithm can be used to design a binding compound based 

2 0 on a conformer model including, for example, LUDI as 

described by Bohm, J. Comput . Aided Mol . Pes. 6:61-78 
(1992) . 

A conformer model need not include all atoms of a 
pharmacocluster . Thus, a conformer model can include a 
25 portion of atoms in a pharmacocluster so long as the portion 
consists of contiguous atoms of a bound conformation of a 
ligand and provides sufficient information to distinguish 
one pharmacocluster from another. Thus, a conformer model 
can be constructed by overlaying corresponding fragments of 

3 0 bound conformations of a ligand and obtaining an average 
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structure according to the methods described above. A 
conformer model made from a portion of a ligand can be 
advantageous due to its small size compared to a complete 
structure of the ligand from which it was derived. A 
5 conformer model based on a portion of a bound conformation 
of a ligand can also be used to more efficiently and rapidly 
query a database due to a reduced use of computer memory 
compared to the memory required to manipulate and store a 
structure containing all atoms of the ligand. 

10 The invention provides a ligand conformer model, 

selected from the group consisting of conformer model 1 
having coordinates listed in Table 3C, conformer model 2 
having coordinates listed in Table 4C, conformer model 3 
having coordinates listed in Table 5C, conformer model 4 

15 having coordinates listed in Table 6C, conformer model 5 
having coordinates listed in Table 7C, conformer model 6 
having coordinates listed in Table 8C, conformer model 7 
having coordinates listed in Table 9C, and conformer model 8 
having coordinates listed in Table IOC. Conformer models 1-8 

2 0 are average structures calculated from pharmacoclusters 1-8 
respectively. The conformer models were determined as 
described in Example III and are shown in Figure 4. 

The invention also provides moiety, having 
coordinates listed in Table 3C, coordinates listed in Table 
25 4C, coordinates listed in Table 5C, coordinates listed in 

Table 6C, coordinates listed in Table 7C, coordinates listed 
in Table 8C, coordinates listed in Table 9C, or coordinates 
listed in Table IOC or subsets of the respective coordinate 
sets thereof. In one embodiment the moiety is not 
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nicotinamide adenine dinucleotide or nicotinamide adenine 
dinucleotide phosphate. 

Additionally, the invention provides a method for 
constructing a pharmacophore model by constructing a model 
5 that contains one or more selected conformation-dependent 
properties of one or more pharmacoclusters . A method for 
constructing a pharmacophore model can include the steps of 
(a) determining bound conformations of a ligand bound to 
different polypeptides; (b) identifying two or more bound 

10 conformations of the ligand having substantially the same 

bound conformation; (c) identifying a conformation-dependent 
property of the bound conformations of the ligand having 
substantially the same bound conformation, the conformation- 
dependent property being correlated with the bound 

15 conformation of the ligand, and (d) constructing a model 
that contains one or more selected conformation- dependent 
properties of one or more pharmacoclusters. 

Additionally, a method for constructing a 
pharmacophore model can include the steps of (a) determining 

2 0 bound conformations of a ligand, or portion thereof, bound 
to different polypeptides; (b) clustering two or more bound 
conformations of the ligand, or portion thereof, having 
substantially the same bound conformation, thereby 
identifying a pharmacocluster , and (c) determining an 

25 average structure of the bound conformations of the ligand, 
or portion thereof, in the pharmacocluster, wherein the 
average structure is a pharmacophore model. A method for 
constructing a ligand conformer model can also include the 
steps of (a) determining a bound conformation of a ligand, 

30 or portion thereof, bound to a polypeptide; (b) determining 
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a pharmacocluster having substantially the same bound 
conformation as the bound conformation, thereby identifying 
the bound conformation of the ligand as a member of the 
pharmacocluster, and (c) determining an average structure of 
5 the bound conformations of the ligand in the 

pharmacocluster, wherein the average structure is a 
pharmacophore model. 

A pharmacophore model constructed by the methods 
of the invention can be derived from any conformation- 
dependent property that is correlated with a 

pharmacocluster. An example of a pharmacophore model useful 
in the methods of the invention is a conformer model. 
Additionally, a pharmacophore model can include a portion of 
a bound conformation, wherein the portion need not contain 
contiguous atoms of a bound conformation of a ligand so long 
as the pharmacophore model provides sufficient information 
to distinguish one pharmacocluster from another. Thus, a 
pharmacophore model can appear as points in space 
unconnected by any semblance of a covalent bond due to 
absence of intervening atoms. For example, a pharmacophore 
model constructed from a pharmacocluster of nicotinamide 
adenine dinucleotide bound conformations can contain a 
phosphate moiety and nicotinamide ring moiety absent the 
ribose moiety which intervenes in a complete model of the 
structure . 

A pharmacophore model can be any representation of 
points in a defined coordinate system that correspond to 
positions of atoms in a bound conformation of a ligand. For 
example, a point in a pharmacophore model can correlate with 
3 0 the center of an atom in a conformer model. An atom of a 
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conformer model can also be represented by a series of 
points forming a line, plane or sphere. A line, plane or 
sphere can form a geometric representation designating, for 
example, shape of one or more atoms or volume occupied by 
5 one or more atoms . 



coordinate system including, for example, a 2 dimensional 
Cartesian coordinate system or 3 dimensional Cartesian 
coordinate system. Other coordinate systems that can be 
10 used include a fractional coordinate system or reciprocal 
space such as those used in crystallographic calculations 
which are described in Stout and Jensen, supra. 



conformation of a ligand, a pharmacophore model can include 
15 other characteristics of atoms or moieties of the ligand 
including, for example, charge or hydrophobicity . Thus, a 
pharmacophore model can be a generalized structure, which 
includes but does not unambiguously describe the bound 
conformations of the ligand bound to the polypeptides in the 
2 0 pharmacof amily from which it was derived. For example, 
atoms can be represented as units of charge such that an 
oxygen in a bound conformation of a ligand can be 
represented by an electronegative point in the pharmacophore 
model. In this example, the electronegative point in the 
25 pharmacophore model includes any electronegative atom at 
that particular location including, for example, an oxygen 
or sulfur. 



A pharmacophore model can be represented in any 



In addition to a geometric description of a bound 
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A pharmacophore model can be constructed to 
include, in addition to characteristics of the ligand 
itself, characteristics of an atom or moiety that interacts 
with the ligand and from a bound polypeptide. 
5 Characteristics of an interacting polypeptide atom or moiety 
that can be included in a pharmacophore model include, for 
example, atomic number, volume occupied, distance from an 
atom of the ligand, charge, hydrophobic ity, polarity, or 
location relative to the ligand. Methods for constructing a 
10 pharmacophore model to include interacting atoms from a 
polypeptide are provided in Example III. 

A characteristic included in a pharmacophore model 
can be incorporated into a geometric representation using 
any additional representation that can be correlated with 
the characteristic. For example, use of color or shading 
can be used to identify regions having characteristics such 
as charge, polarity, or hydrophobicity . As such, the depth 
of shading or color or the hue of color can be used to 
determine the degree of a characteristic. By way of 
example, a common convention used in the art is to identify 
regions of increased positive charge with deeper shades of 
blue, areas of increased negative charge with deeper shades 
of red and neutral regions with white. Numeric 
representations can also be used in a pharmacophore model 
including, for example, values corresponding to potential 
energy for an interaction, or degree of polarity. 

In addition, a pharmacophore model can incorporate 
constraints of a physical or chemical property of the bound 
conformations of a ligand in a pharmacocluster . A 
3 0 constraint of a physical property can be, for example, a 
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distance between two atoms, allowed torsion angle of a bond, 
or volume of space occupied by an atom or moiety. A 
constraint of a chemical property can be, for example, 
polarity, van der Waals interaction, hydrogen bond, ionic 
5 bond, or hydrophobic interaction. Such constraints can be 
included in a pharmacophore model using the representations 
described above. 

A pharmacophore model can include two or more 
pharmacoclusters. In order to identify a ligand having 

10 broad specificity for two or more polypeptide 

pharmacofamilies, a pharmacophore model can be derived from 
the two or more corresponding pharmacoclusters . 
Additionally, in order to identify a ligand that can 
preferentially bind a first polypeptide which belongs to a 

15 first polypeptide pharmacof amily compared to a second 
polypeptide of a second polypeptide pharmacof amily, a 
pharmacophore model can incorporate constraints on geometry 
or any other characteristic so as to exclude a 
characteristic of the bound conformation of the ligand bound 

20 to the second polypeptide. For example, a geometric 

constraint can be a forbidden region for one or more atom of 
a bound conformation of a ligand. A forbidden region can be 
identified by overlaying two conformer models in a 
coordinate system and identifying a coordinate or set of 

25 coordinates differentially occupied by one or more atoms of 
the conformer models. A pharmacophore model incorporating a 
forbidden region as such will be specific for a polypeptide 
of one pharmacof amily over a polypeptide of a second 
pharmacof amily correspondent with the constraint 

3 0 incorporated. 
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An advantage of the invention is that a 
pharmacophore model can be created based on multiple 
structures of the same ligand. In comparison to a 
pharmacophore model derived from a single structure or 
5 different ligands, a pharmacophore model derived from 

multiple bound conformations of the same ligand can include 
a greater degree of geometric information. For example, 
averaging of multiple bound conformations of the same ligand 
can provide torsion angle constraints that are not available 
10 from a single structure and not evident from comparing 
different ligands . 

The invention further provides a method for 
identifying a binding compound for one or more members of a 

15 polypeptide pharmacof amily by identifying a compound having 
a selected conformation-dependent property of a 
pharmacocluster . A binding compound can be any molecule 
having selected conformation- dependent properties of a 
ligand such that the binding compound can form a complex 

20 with one or more members of one or more polypeptide 

pharmacof amily . A method for identifying a binding compound 
for one or more members of a polypeptide pharmacof amily can 
include the steps of contacting a ligand with a polypeptide 
member of a pharmacof amily ; identifying a conf ormation- 

25 dependent property associated with a bound conformation of 
the ligand bound to the polypeptide; comparing the 
conformation-dependent property of the bound conformation of 
the ligand bound to the polypeptide with a conformation- 
dependent property of a bound conformation of a ligand bound 

30 to another polypeptide in the same pharmacof amily ; and 
identifying a ligand bound to the polypeptide with a 
conformation-dependent property similar to a bound 
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conformation of a ligand bound to another polypeptide in the 
same pharmacof amily , thereby identifying a compound that 
binds one or more polypeptide members of a pharmacof amily . 
A compound that binds to one or more members of a 
5 polypeptide pharmacof amily can be identified by determining 
a conformation-dependent property by any of the methods 
described herein. For example, a ligand conformation or 
spectroscopic signal can provide a conformation-dependent 
property useful in identifying a compound that binds to one 
10 or more members of a polypeptide pharmacof amily . 

The methods described herein for identifying a 
binding compound for one or more members of a polypeptide 
pharmacof amily can readily be adapted to a high throughput 
screening method. For example, methods of rapidly detecting 

15 a conformation-dependent property in a sequence of samples 
or detecting a conformation-dependent property in parallel 
samples can be applied to a high- throughput screen. One 
skilled in the art will know how to adapt the methods 
described here to a high throughput screening format using, 

20 for example, robotic manipulation of samples. 

A method for identifying a binding compound for 
one or more members of a polypeptide pharmacof amily can 
include the steps of determining a bound conformation of a 
ligand bound to a polypeptide member of a polypeptide 

25 pharmacof amily; comparing the bound conformation of the 
ligand bound to the polypeptide member of the polypeptide 
pharmacof amily to a pharmacophore model; and identifying the 
bound conformation of the ligand bound to the polypeptide 
member of the polypeptide pharmacof amily that satisfies the 

3 0 constraints of the pharmacophore model as a binding compound 
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for one or more members of the pharmacof amily in which the 
polypeptide member belongs. 

A pharmacophore model can be useful in querying a 
database of polypeptide structures to find other members of 
5 a polypeptide pharmacof amily . For example, a member of a 
polypeptide pharmacof amily can be identified by querying a 
database of bound conformations of a ligand to retrieve a 
structure that fits the constraints of the query 
pharmacophore model, thereby identifying the retrieved 

10 polypeptide as a member of the pharmacof amily from which the 
pharmacophore model was derived. A pharmacophore model can 
also be used to identify a new member of a polypeptide 
pharmacof amily by querying a database of one or more 
polypeptide structures using an algorithm that docks or 

15 compares the pharmacophore model to polypeptide structures, 
wherein a favorable docking or comparison identifies a 
polypeptide as a member of the same polypeptide 
pharmacof amily from which the pharmacophore model was 
derived. The database queries described above can be 

20 performed with algorithms available in the art including, 
for example, THREEDOM and CATALYST. 

An advantage of the invention is that a 
pharmacophore model can also be used to identify a binding 
compound that is specific for polypeptides of one or more 

25 pharmacof amilies . For example, a pharmacophore model can be 
compared to a structure of a compound or to a bound 
conformation of a ligand to identify those having similar 
properties. A conformer model can be further used to query 
a database of compounds to identify individual compounds 

30 having similar properties. 
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A pharmacophore model of the invention can also be 
used to design a binding compound that is specific for 
polypeptides of one or more pharmacof amilies . A 
pharmacophore model identified by these criteria can be used 
5 as a scaffold or set of constraints for developing a 

compound having enhanced binding affinity or specificity for 
polypeptides of of one or more pharmacof amilies . Using 
similar methods a pharmacophore model can be used to design 
a combinatorial synthesis producing a library of compounds 

10 having properties consistent or similar to the model which 
can be then be screened for enhanced binding affinity or 
specificity for polypeptide members of one or more 
pharmacof amilies , An algorithm can be used to design a 
binding compound based on a pharmacophore model including, 

15 for example, LUDI as described by Bohm, J. Comput , Aided 
Mol. Pes. 6:61-78 (1992). 

A compound can be identified as satisfying the 
constraints of a pharmacophore model by a variety of methods 
for comparing structures. For example, a pharmacophore 

20 model that is a geometric representation such as a conformer 
model can be overlaid with a compound, and the best fit 
determined as described herein. Substantial overlap between 
a compound and a pharmacophore model can be indicated by a 
visual comparison and/or computation based comparison based 

25 on for example, RMSD values or torsion angle values as 

described above. In a case where a pharmacophore model is 
represented by constraints, a compound can be fitted to the 
pharmacophore model to identify if the properties of the 
compound satisfy the constraints of the pharmacophore model. 

3 0 For example, if a pharmacophore model contains, as a 

constraint, a maximum distance between atoms, a compound 
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that satisfies the constraint can be identified as having a 
bond distance between corresponding atoms that is at least 
the maximum value. One skilled in the art will know how to 
extend such methods of comparison to any physical or 
5 chemical constraint. 

A compound can also be identified as satisfying 
the constraints of a pharmacophore model by demonstrating 
the same characteristics for one or more specific atom 
located within a volume of space defined by the geometric 

10 constraints of the pharmacophore model. For example, in a 
case where polarity is a constraint and where a conformation 
of a compound can be overlaid with a pharmacophore model, an 
atom that overlaps a volume of space indicated by the 
pharmacophore and having polarity within the defined limits 

15 can be identified as satisfying constraints of the 

pharmacophore. By extension, a compound having atoms which 
satisfy all constraints of a pharmacophore is identified as 
a binding compound for one or more members of a polypeptide 
pharmacof amily from which the pharmacophore was produced. 

2 0 Therefore, the invention provides a binding 

compound identified by the above described methods. For 
example, the invention provides a binding compound 
identified using a pharmacophore model or a conformer model 
derived from a pharmacocluster and/or pharmacof amily . 

25 The invention provides a pharmacophore model, 

selected from the group consisting of pharmacophore model 1 
having coordinates listed in Tables 3B and 3C, pharmacophore 
model 2 having coordinates listed in Tables 4B and 4C, 
pharmacophore model 3 having coordinates listed in Tables 5B 
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and 5C, pharmacophore model 4 having coordinates listed in 
Tables 6B and 6C, pharmacophore model 5 having coordinates 
listed in Tables 7B and 7C, pharmacophore model 6 having 
coordinates listed in Tables 8B and 8C, pharmacophore model 
5 7 having coordinates listed in Tables 9B and 9C, and 

pharmacophore model 8 having coordinates listed in Tables 
lOB and IOC. 

The invention also provides a medium comprising a 
storage medium and stored in the medium, atom coordinates 

10 selected from the atomic coordinates listed in Table 3B, 3C, 
4B, 4C, 5B, 5C, 6B, 6C, 7B, IC, 8B, 8C, 9B, 9C, lOB or IOC, 
or a subset thereof. In one embodiment the medium comprises 
a computer readable medium. The use of a computer apparatus 
is convenient since atomic coordinates can be conveniently 

15 stored and accessed for manipulation including, for example, 
docking to a polypeptide structure or comparison to 
coordinates for other bound conformations of a ligand. 
Exemplary methods for manipulating atomic coordinates are 
described above. 

2 0 It is understood that a computer apparatus of the 

invention need not itself store atomic coordinates of the 
invention. The computer apparatus contains an algorithm for 
viewing a structure from the coordinates or otherwise 
manipulating the coordinates. By using various hardware, 

2 5 software and network combinations, the atomic coordinates 

can be manipulated in a variety of configurations. Such a 
separate medium can be another computer apparatus, a storage 
medium such as a floppy disk. Zip disk or a server such as a 
file-server, which can be accessed by a carrier wave such as 

3 0 an electromagnetic carrier wave. One skilled in the art 
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will know or can readily determine appropriate hardware, 
software or network interfaces that allow interconnection of 
an invention computer apparatus. 



The methods of the invention described herein can 
5 be performed in a computer apparatus using the atomic 

coordinates listed in Table 3B, 3C, 4B, AC, 5B, 5C, 6B, 6C, 
7B, 7C, 8B, 8C, 9B, 9C, lOB or IOC by adding the step of 
entering the coordinates or a subset of the coordinates to 
the computer apparatus that performs a method of the 
10 invention. One skilled in the art will know or can readily 
determine an algorithm instructing a computer apparatus to 
carry out the methods of the invention. 



It is understood that modifications which do not 
substantially affect the activity of the various embodiments 
15 of this invention are also provided within the definition of 
the invention provided herein. Accordingly, the following 
examples are intended to illustrate but not limit the 
present invention . 

EXAMPLE I 

2 0 Identification of Polypeptide Phanaacof ami lies Based on 
Bound Conformations of NAD(P) (H) Ligands 



This example describes identification of ligand 
conformer groups and corresponding polypeptide 
pharmacof amilies based on bound conformations of NAD(P) (H) 
25 bound to polypeptide oxidoreductases . 




71 

The oxidoreductases form a family of polypeptides 
that bind NAD{H) and NADP (H) . In order to identify 
pharmacof amilies within the family of oxidoreductases, bound 
conformations of NAD(P) (H) were determined by searching the 
5 protein databank. Bound conformations from 156 structures 
were clustered into separate pharmacoclusters, and 
pharmacof amilies were identified according to binding to 
bound conformations of NAD{P) (H) in separate 
pharmacoclusters , 

10 Structure files containing polypeptides with bound 

NAD(P) (H) were identified from the protein databank by 
keyword searches using the database software. Keywords 
included ^^NAD," ^^NADH," ^^NADP," "NADPH," ^^oxidoreductase , " 
"dehydrogenase" and "reductase." Cluster analysis was 

15 performed using the algorithm COMPARE (Chiron Corp, 1995; 
distributed by Quantum Chemistry program Exchange, 
Indianapolis IN) in combination with visual inspection. 
All clusters were visually inspected using Insight 98 for 
outliers that demonstrated poor overlay with the rest of the 

20 pharmacocluster as a whole. These outliers were compared 
against each other and existing pharmacoclusters to find 
other possible matches. Those that did not fit any family 
were removed. Comparison between bound conformations was 
made based on the RMSD equations supplied in COMPARE. 

25 Eight pharmacoclusters were identified by this 

method, as shown in Figure 1. Visual inspection of the 
clusters in Figure 1 demonstrates that members within a 
cluster are substantially overlapped. Comparison between 
clusters demonstrates substantial differences. For example, 

3 0 the bound conformations in cluster 5 have an extended 
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structure compared to the bound conformations in cluster 4, 
which form a horseshoe like shape. Other differences 
include, for example, a flip in the nicotinamide ring 
between cluster 1 and cluster 2 such that the nicotinamide 
5 ring is anti to the ribose in cluster 1 and syn to the 
ribose in cluster 2 and a change in torsion angle in the 
bonds connecting the adenine ribose to the adenine phosphate 
for the bound conformations of cluster 3 compared to those 
of cluster 2 . 

10 The dihedral angles for various bonds in the bound 

conformations of the NADP (H) ligand can be used to 
distinguish the pharmacoclusters . As shown in Table 1 (see 
Figure 2 for atom and bond locations) , although many 
dihedral angles are similar between two or more 

15 pharmacoclusters, each pharmacocluster can be distinguished 
from the others by comparison of the full set of dihedral 
angles. For example, pharmacoclusters 2 and 3 can be 
distinguished by comparison between the dihedral angles at 
04 • A-C4 ' A-C5 » A-05 ' A which are 154 degrees and -131 degrees 

2 0 respectively and by comparison between the dihedral angles 
at C5 ^ A-05 ' A-PA-03 which are 105 degrees and 57 degrees 
respectively. 
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A quantitative analysis of the results of 
clustering bound conformations of NAD(P) (H) is provided in 
Table 2 . Table 2 shows RMSD values calculated from 
comparisons between each pharmacocluster ' s average 
5 coordinates- Average coordinates were determined from the 
pharmacocluster subsets listed in Tables 3 through 10 as 
described below. 




75 



00 



C3 

m 
y: 

H 

iiJ 

fU 

rij 
o 







0) 




jj 












-H 




Tl 




M 




0 




0 




0 




0) 




D) 




(d 




U 




0) 




> 










m 






'u 




0) 








CO 




;3 




H 


0 




0 




0 












}^ 


00 








CM 








0 




ea 


oa 




0) 




0) 




? 






H 


0) 




XI 




Q 

















CM CM CM H 00 ro 

^ VD O i> CX) LO O 

H CM m ro CM CM CO 



00 CM o ^ ro o 
\D iTi o m a\ m 

CM CM m ^ H CM 



^ l> l> 

i> CO VO ^ 



CM ro CO ^ CM 



H H LO CM 
ro LD 00 CM 



CM CM CM LO 



H H CO 

00 CO 



ro ro ro 



^ ID 

CM <J\ 

CM O 



00 



HcMro^Lnvoi>oo 



CM 

0) 
H 

(d 




76 

Tables 3A, 4A, 5A, 6A, 7A, 8A, 9A and lOA show 
RMSD values for subsets of members of pharmacoclusters 1-8, 
^respectively . The RMSD values for each member were 
calculated as comparisons to an average structure for the 
5 subsets shown in each table respectively. For each 

pharmacocluster a subset of the possible ligands that belong 
to each cluster were identified. Each subset was chosen to 
maximize the diversity of the family and to minimize over- 
representation of ligand conformations from enzymes that 

10 exist multiply in the PDB database. The goal of the subset 
selection was to fully represent characteristics from 
oxidoreductases belonging to a range of species and 
catalyzing a range of different reactions. For example, 
there exists over ten alcohol dehydrogenases in the PDB 

15 database; however, for purposes of this study, only three 
were chosen from three different species for use in the 3D 
overlay and the pharmacophore construction. Average 
coordinates for the above described pharmacocluster subsets 
were obtained by overlaying ligand structures in MSI 

2 0 Insight II using the overlay function. The three dimensional 

coordinates for each atom in each ligand were used to 
calculate an average position and a standard deviation for 
the pharmacof amily . 

Comparison of the RMSD values in part A of Tables 
25 3 through 10 with the RMSD values in Table 2 demonstrate 
that a member of a pharmacocluster can be identified as 
having a lower RMSD compared to an average conformation of 
the members in its pharmacocluster than the RMSD between 
each family's average coordinates. In some cases it can be 

3 0 beneficial to combine two or more methods of comparison. 

For example, as described above pharmacoclusters 2 and 3 
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which have a relatively low RMSD when compared to each other 
can be distinguished from each other by visual inspection 
and by comparison of dihedral angles at various bonds. 

These results demonstrate that bound conformations 
5 of a ligand can be grouped into pharmacoc lusters by methods 
of structure comparison. These results also demonstrate 
methods for distinguishing pharmacoclusters and members 
within pharmacoclusters. 



Example II 

10 Correlation Between the Structure of Polypeptides and the 

Bound Conformations of NAD{P) (H) 



This example describes a correlation between bound 
conformations of NAD(P) (H) and structural classification of 
polypeptides such that polypeptides of a pharmacof amily have 
15 similar protein fold. 

Pharmacoclusters for conformations of NAD(P) (H) 
bound to oxidoreductase polypeptides were clustered as 
described in Example I. For each polypeptide the protein 
fold, SCOP super-family designation and SCOP family 
20 designation was identified from the SCOP website 

administered by Laboratory of Molecular Biology at the MRC, 
Cambridge England {http://mrc-lmb.cam.ac.uk) . 

Table 11 shows the grouping of NAD(P) (H) binding 
polypeptides into 8 pharmacof amilies . 



78 







SCOP -Family 


Alcohol/glucose 
dehydrog . 


Alcohol /glucose 
dehydrog , 


Alcohol/glucose L 
dehydrog . 1 


Alcohol /glucose 
dehydrog . 


Alcohol /glucose 
dehydrog . 


Alcohol /glucose 
dehydrog . 


Alcohol /glucose 
dehydrog , 


Alcohol /glucose 
dehydrog . 


Alcohol /glucose 
dehydrog , 


Alcohol /glucose 
dehydrog . 


Alcohol /glucose 
dehydrog . 


Alcohol /glucose 
dehydrog , 


Alcohol /glucose 
dehydrog . 


Alcohol/glucose 
dehydrog . 


Alcohol/glucose 
dehydrog . 






SCOP -Super family 


NAD(P) binding 
Rossman 
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The results shown in Table 11 demonstrate that 
bound conformation of NAD(P) (H) can be correlated with 
protein fold. Grouping oxidoreductases into 
pharmacof amilies based on the bound conformations of 
5 NAD(P) (H) resulted in a correlation with protein fold. 
Pharmacof amilies 1-3 consist of polypeptides having the 
NAD{P) (H) binding Rossman fold. Pharmacof ami ly 4 consists 
of polypeptides having heme-linked catalase fold. 
Pharmacof amily 5 consists of polypeptides having the p-a TIM 

10 barrel fold. Pharmacof amily 6 consists of polypeptides 

having the dihydrof olate reductase fold. Pharmacof amily 7 
consists of polypeptides having the FAD/NAD (P) (H) binding 
domain fold. Trypanathione reductase was added to family 7 
by homology of its active site to the active sites of other 

15 members of pharmacof amily 7 independent of bound ligand 
conformation. Pharmacof amily 8 consists of polypeptides 
having the ferrodoxin like fold. Pharmacof amilies 1 and 2 
were identified based on anti or syn conformation, 
respectively, of the nicotinamide ring relative to the 

20 ribose. Additionally, a change in the torsion angles in the 
bonds connecting the adenine ribose to the adenine phosphate 
separates the family members having a Rossman fold into a 
third pharmacof amily, identified as pharmacof amily 3. 

The results described in this example demonstrate 
25 that a bound conformation of a ligand can be correlated with 
polypeptide fold. Furthermore, the results obtained by the 
method are consistent with results obtained by SCOP. 
Therefore, classification based on bound conformation of 
ligands can be used to classify polypeptides according to 
3 0 structure. 
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EXAMPLE III 

Determination of a conformer model and pharmacophore for 

pharmacoclusters 1-8 

This example demonstrates determination of the 
5 average bound conformations from pharmacoclusters 1-8 and 
construction of conformer models based on the average bound 
conformations. This example also demonstrates construction 
of a pharmacophore model based on the average bound 
conformations and interactions with polypeptides. 

10 Conformer models for each pharmacocluster were 

produced by determining an average structure for the subset 
of members of each pharmacocluster as described in Example 
I. The coordinates for conformer models of pharmacoclusters 
1-8 are shown in Part C of Tables 3-10 respectively. 

15 Pharmacophore models were constructed by aligning 

the active sites of a pharmacof amily of oxidoreductases . 
Three-dimensional overlays were achieved using Insight II 
overlay module to overlay the NAD(P) ligands of each enzyme- 
ligand complex. Heteroatoms in the surrounding protein that 

20 could function as hydrogen bond acceptors or hydrogen bond 
donors were identified in each complex that made 
interactions with the NAD(P) ligand. These heteroatoms that 
had common positions in three dimensional space (within 3A 
of each other in the overlay) in each enzyme complex and 

25 that made a common interaction with the ligand were then 
grouped together and tabulated for pharmacophore 
construction. Water molecules were similarly identified and 
grouped. The grouped heteroatoms and water molecules are 
listed in Part D of Tables 3-10 below. Finally the average 




94 



coordinates and the standard deviation for each interaction 
group were calculated. The final pharmacophore model was 
produced by overlaying interaction groups on the conformer 
model (average ligand structure) . 

5 The coordinates for pharmacophore models of 

pharmacoclusters 1-8 are shown in parts B and C of Tables 3- 
10, respectively. Specifically, each conformer model 
includes the average NAD(P) coordinates (in part C of each 
Table) and the pharmacophore model includes both the average 

10 NADP coordinates, average water coordinates and the average 
protein heteroatom coordinates (including coordinates in 
both part B and C of each Table) . An exception is the 
pharmacophore model derived from pharmacof amily 7 which 
includes average water coordinates and average protein 

15 heteroatom coordinates for all polypeptides listed but has a 
conformer model derived from NAD{P) bound to each 
polypeptide listed except trypanathione reductase. 

A structural representation of each conformer 
model with overlayed interaction groups used to determine 

20 respective pharmacophore models 1-8 is provided in Figure 3. 
The structures shown in Figure 3 reflect the average NAD(P) 
coordinates shown in Part C of Tables 3-10 and the 
coordinates for all interacting groups used to calculate the 
average water coordinates and the average protein heteroatom 

25 coordinates as shown in Part D of Tables 3-10. Hydrogen 

bond acceptors are labeled with an 'A' followed by a number 
for each group. These are listed in the pharmacophore 
Tables and designated on the pharmacophore figures. Donors 
are labeled with a 'D' ; and water molecules are labeled with 

30 a . 



95 



This example demonstrates construction of 
conformer models based on the bound conformations of ligands 
in pharmacoclusters , This example also demonstrates 
construction of a pharmacophore model based on the bound 
5 conformations of ligands in pharmacoclusters and their 
interactions with polypeptides in their respective 
pharmacof amilies . 

Example IV 

Correlation Between the Bound Conformation of Ligands and a 
10 Conformation-Dependent Property 



This example describes a conformation-dependent 
property that is correlated with a bound conformation of a 
ligand. 

A 2D [^H,^H] NOESY spectrum was recorded with a 
15 0.2 ml sample of 1 mM NADP and 2 00 |aM of enzyme 1-deoxy D- 
xylulose 5-phosphate reductoisomerase (DOXP) . The spectrum 
was measured with a Bruker DRX700 spectrometer operating at 
700 MHZ frequency. The total measuring time was about 12 
h. 



2 0 The spectrum is shown in Figure 4 and atoms are 

identified according to Figure 2. The relative intensities 
of the observed transferred NOEs (trNOEs) between the ribose 
proton H-Cl'N(NClM and the protons on the nicotinamide 
ring, H-C4N and H-C2N shown in Figure 4, reveal that the 

25 NADP adopts a syn conformation when bound to the enzyme. 
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The bound conformations in Pharmacocluster 1 and 2 
can be distinguished according to anti or syn conformation, 
respectively, of the nicotinamide ring relative to the 
ribose. Therefore, these results demonstrate that the 
5 relative intensities of the observed trNOE's between the 
ribose proton H-Cl'N(NCl') and the protons on the 
nicotinamide ring, H-C4N and H-C2N can provide a 
conformation dependent property useful in distinguishing 
members of pharmacoclusters 1 and 2 . 



10 Example V 

Binding compounds having specificity for one or more 
polypeptide pharmacof amilies . 

This example demonstrates querying a database of 
compounds to identify individual compounds having similar 
15 conformations. This example also demonstrates preferential 
binding of a compound to a polypeptide of one pharmacof amily 
over another. 



The TTEOOOl. 001.A07 AND TTEOOOl . 002 . D02 compounds 
were identified by using the THREEDOM algorithm to query a 

20 database of commercially available molecules (ASINEX; 
Moscow, Russia) by shape matching with cibacron blue. 
Coordinates of cibacron blue were obtained from the 
published 3D structure (Li et al , , Proc . Natl. Acad, Sci . 
USA 92:8846-8850 (1995)) . The database was created by 

25 converting an SD format file of structures from ASINEX to 
INTERCHEM format coordinates using the batch2to3 program. 
Cibacron blue was compared against each structure in the 
database in multiple orientations to generate a matching 
score. Out of 37,926 structures searched, the 750 best 
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matching scores were selected. From these 750 structures, 
TTEOOOl . 001 .A07 AND TTEO 0 01 . 002 . D02 were selected and 
purchased based on objective criteria such as likely 
favorable binding interactions, pharmacophore properties, 
5 synthetic accessibility and likely pharmacokinetic, 
toxicological , adsorption and metabolic properties. 

Kinetic studies were carried out in 1-cm cuvettes 
in a 1 mL volume at 25°C. Lactate dehydrogenase reactions 
were monitored spectrophotometrically with a Gary 300 by 

10 following the decrease in absorbance at 34 0 nm due to the 
oxidation of NADH by pyruvate. Lactate dehydrogenase 
reaction mixtures contained 100 mM Hepes buffer at pH 7.4, 
as well as 2.5 mM pyruvate, 10 jLiM NADH, 5 ng/mL lactate 
dehydrogenase. NADPH, NADH, Hepes buffer, and rabbit muscle 

15 lactate dehydrogenase were purchased from Sigma. Cytochrome 
P450 reductase reactions were monitored by following the 
decrease in absorbance at 550 nm due to the reduction of 
ferric cytochrome c by NADPH. Cytochrome P450 reductase 
reaction mixtures contained 100 mM Hepes buffer at pH 7.4, 

2 0 as well as 8 0 |liM ferric cytochrome c, 10 |LiM NADPH, and 80 

ng/mL cytochrome P450 reductase. Data were fitted using the 
FORTRAN programs of Cleland, Adv. Enzymol . 45: 273-387 
(1977) which perform nonlinear least squares fits to the 
appropriate equations. Substrates were varied around their 
25 Michaelis constants, while nonvaried substrate was kept at a 
concentration close to its Michaelis constant. The 
concentration of inhibitor that gives 50% inhibition (IC50) 
values were obtained by fitting data to the equation for a 
line, where Y values are 1/rate and X values are the 

3 0 concentration of inhibitor, as in a Dixon plot (Segel, 

supra). The X-intercept is the IC50. If a full kinetic 
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profile was done, then K^^ values were obtained by fitting 
the data to the equation for a competitive inhibitor: 



rate = 



Kjl + I/Ki3) + A 



where rate is the rate of reaction in units of 
absorbance/minute, V^^^ is the maximum velocity, is the 
Michaelis constant for A, K^g is the inhibition dissociation 
constant for the inhibitor, I is the inhibitor 

10 concentration, and A is the concentration of NADH or NADPH. 
In all cases, the fit to the above equation was used only 
after establishing that the fit to equations for 
noncompetitive and uncompetitive inhibition were less 
appropriate based on values for sigma (overall fit) as well 

15 as standard deviations for fitted constants (K^g and K^^) . 

As shown in Figure 5, compound TTEOOOl . 001 . A07 
could inhibit binding of NADH to lactate dehydrogenase and 
NADPH to cytochrome P450 reductase which are polypeptide 
2 0 members of pharmacof amily 1 and 8 respectively. Compound 
TTEOOOl . 001 .A07 demonstrated high binding affinity for both 
lactate dehydrogenase and cytochrome P4 5 0 reductase. 

Analysis of inhibition of binding between NADH and 
lactate dehydrogenase is shown in Figure 6. Compound 
25 TTEOOOl . 002 .D02 inhibited lactate dehydrogenase with a K^^ 

of 2.1 )aM. Similar measurements of cytochrome P450 
reductase with concentrations of compound TTEOOOl . 002 . D02 up 
to 0.5 mM did not indicate inhibition. These results 
indicated that compound TTEOOOl . 002 . D02 had a K^g of greater 
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than 1 mM with cytochrome P4 50 reductase. Thus, compound 
TTEOOOl . 002 .D02 demonstrated preferential binding for 
pharmacof amily 1 having an inhibitory dissociation constant 
(Kis) that was at least 500 fold lower than for 
5 pharmacof amily 8 . 

The results described in this example demonstrate 
that a binding compound can be identified by structural 
comparison to a bound conformation of a ligand. 
Furthermore, the results demonstrate that binding compounds 
10 that interact with polypeptides from multiple 

pharmacof amilies or compounds that preferentially bind to 
polypeptides of one pharmacof amily compared to polypetides 
of another pharmacof amily can be identified by structural 
comparison to a bound conformation of a ligand. 

15 Example VI 

Identification of a ligand using a pharmacophore model 

This example demonstrates construction of a 
pharmacophore model, use of the model to identify a binding 
ligand and confirmation of the ability of the identified 
2 0 compound to bind a polypeptide member of the pharmacof amily 
from which the pharmacophore model was derived. 

Pharmacophore models were constructed to include 
part or all of the NAD(P) shape, hydrogen bond donors, 
hydrogen bond acceptors and/or other chemical features 
25 described in Tables 3-10. The combination of chemical 
features chosen for each search pharmacophore in a search 
set were chosen in an attempt to cover a diverse range of 
combinations of possible chemical interactions and to 
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represent the protein ligand interactions that occur most 
frequently in the particular pharmacof amily . 



Pharmacophore shape was derived using the program 



CATALYST, and was calculated using the Van der Waals surface 
5 for part or all of the structure of the averaged NAD(P) 
coordinates determined for a pharmacocluster . Desired 
hydrogen bonding features, water molecules and other 
chemical motifs were positioned in the pharmacophore model 
using the average coordinates determined for both the 
10 pharmacof amily and pharmacocluster. 



from the coordinates presented in Table 3 for pharmacof amily 
1 are shown in Figure 7 . Figure 7A shows the structure for 
the conformer model having coordinates listed in Table 3C 
15 with a superimposed volume defining the shape of the ligand 
and indicated by grey spheres. A hydrophobic feature was 
added to the pharmacophore model at the average position of 
the hydrophobic region of the nicotinamide ring as shown in 
Figure 7B. Also shown in Figure 7B is a hydrogen bond 

2 0 acceptor positioned at the average coordinates for the 

pyrophosphate using the averaged coordinates for the 
location of hydrogen bond acceptors utilized in all of the 
17 polypeptides of the pharmacof amily . Finally, Figure 7B 
shows a hydrogen bond donor positioned according to a 
25 position where a hydrogen bond donor of a ligand would be 
expected to have favorable interactions with hydrogen bond 
acceptors observed in 11 of the polypeptides of 
pharmacof amily 1. Thus, the hydrogen bond donor does not 
identify a position of an actual hydrogen bond donor in the 

3 0 NAD(P) ligand, but instead a location to where a potential 



The components of a pharmacophore model derived 
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ligand's hydrogen bond donor could make favorable 
interactions with the polypeptides of pharmacof amily 1. 
Figure 7C shows the combined features of figures 7A and 7B 
present in a pharmacophore model used to search a database 
5 of compounds . 



To identify potential ligands that bind to 
polypeptides of pharmacof amily 1, computational searches 
were conducted using CATALYST. Searches were made by 
comparing the shape and combination of chemical features of 
10 the pharmacophore model, shown in Figure 7, to the shape and 
features of molecules in the database. 



An example of a compound identified using the 
pharmacophore model shown in figure 7C is TTE0 0 08 . 025 . DOS . 
Using a binding assay similar to that described in Example 
15 V, compound TTE0008 . 025 . DOS was shown to have inhibitory 
activity against pharmacof amily 1 member, 
dihydrodipicolinate reductase (IC^o = 2.8 |aM) . 
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Coordinates for the conformer and pharmacophore 
models and data used in their construction is presented in 
Tables 3-10 above. Part A of each Table lists subset of 
structures used in constructing the model including molecule 
numbers for cross-referencing between parts A-C, the PDB 
accession number, the name of the polypeptide, and the RMSD 
from the pharmacocluster average. Part B of each Table 
lists the average coordinates for heteroatoms and waters of 
the pharmacophore model and includes the atom name (cross 
referenced to part D) , designation of interaction ("ACC," 
acceptor; ^^DON, " donor; and "WAT,'' water), total number of 
atoms included in the calculation of the average, and X, Y, 
Z coordinates with respective standard deviations (a) . Part 
C of each Table lists the coordinates of the conformer model 
using the atom designations of Figure 2 and X, Y, Z 
coordinates with respective standard deviations (a) . Part D 
of each Table lists the coordinates for interacting 
molecules used to determine the pharmacophore model 
including the atom name, residue molecule # (which 
identifies the residue type and molecule number cross- 
referenced to Part A) , residue number from the PDB 
structure, total number of atoms summed for the average 
coordinates, and X, Y, Z coordinates with respective 
standard deviations (a) . The bolded entries in part D 
correspond to the average values reported in part B. Atom 
names are identified according to lUPAC recommendations as 
described for example in Markley et al.. Pure and Appl . 
Chem, 70:117-142 (1998), 

Throughout this application various publications 
have been referenced. The disclosures of these publications 
in their entireties are hereby incorporated by reference in 
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this application in order to more fully describe the state 
of the art to which this invention pertains. 

Although the invention has been described with 
reference to the disclosed embodiments, those skilled in the 
5 art will readily appreciate that the specific details are 
only illustrative of the invention. It is understood that 
modifications which do not substantially affect the activity 
of the various embodiments of this invention are also 
included within the definition of the invention provided 
10 here in. Therefore, it should be understood that various 

modifications can be made without departing from the spirit 
of the invention. Accordingly, the invention is limited 
only by the following claims. 



